I 



2 

GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT 
val leu his his his gin asn pro glu phe 

62 

CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC 
gin leu his glu lys his his leu leu leu 

122 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC 
ser lys gly ser thr lys lys arg asp val 

182 

JCCC CTC CTG AAA GAC GGA AGG GTG GTG ACA 
"-Jjro leu leu lys asp gly arg val val thr 

ii? 42 

G&TT CCT TCG GGC TAT CTT GGC TAC CAA GAG 
NLeu pro ser gly tyr leu gly tyr gin glu 

WS02 

SKTT AAA TGG GTA GAT GGA GGC AAG CCA CTG 
a ile lys trp val asp gly gly lys pro leu 

y,GTG TAT ACT CAG GAT CAG CAT TTA CAT AAT 
2,h val tyr thr gin asp gin his leu his asn 

S 422 

^ GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA 
gly ala gin ala leu gly asn glu leu val 

482 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT 
gly his val met ile ala phe leu pro thr 

542 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC 
arg ala thr gin glu glu val ala val asn 

602 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC 
gin cys his glu glu gly leu glu ser his 

662 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG 
ala glu pro tyr val ala ser glu tyr lys 



l 

A 

32 

TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
tyr asp glu ile lys ile glu leu pro thr 

92 

ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
thr phe phe his val ser cys asp asn ser 

152 

GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
val glu thr gin val gly tyr ser trp leu 

212 

AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
ser glu gin his ile pro val ser ala asn 

272 

CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu gly met gly arg his tyr gly pro glu 

332 

CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
leu lys ile ser thr his leu val ser thr 

3 92 

TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
phe phe gin tyr cys gin lys thr glu ser 

452 

AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
lys tyr leu lys ser leu his ala met glu 

512 

ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
ile leu asn gin leu phe arg val leu thr 

572 

GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
val thr arg val ile ile his val val ala 

632 

TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
leu arg ser tyr val lys tyr ala tyr lys 

692 

ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
thr val his glu glu leu thr lys ser met 



i 




722 

ACC ACG ATT CTC AAG CCT TCT GCC GAT TTC 
thr thr ile leu lys pro ser ala asp phe 

782 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT 
trp phe phe phe asp val leu ile lys ser 

842 | Cadherin Cleavage | 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT 
val lys leu leu arg asn gin arg phe pro 

902 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG 
val asn met leu met pro his ile thr gin 

962 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC 
asn ala asn his ser leu ala val phe ile 

%022 

^MbTT GTC TTC AAG CAG ATC AAC AAC TAC ATT 
^phe val phe lys gin ile asn asn tyr ile 

%t082 

OtCTC TTT GAA TAC AAG TTT GAA TTT CTC CGT 
LUleu phe glu tyr lys phe glu phe leu arg 

5 1142 

|^TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC 
gleu asn leu pro met pro phe gly lys gly 

SSGAC TAC TCA TTA ACA GAT GAG TTC TGC AGA 
asp tyr ser leu thr asp glu phe cys arg 

xxx | 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG 
glu val gly thr ala leu gin glu phe arg 

1322 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT 
lys asn leu leu ile lys his ser phe asp 

1382 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT 
arg ile ala thr leu tyr leu pro leu phe 

1442 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG 
asn val arg asp val ser pro phe pro val 




752 

CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 
leu thr ser asn lys leu leu arg tyr ser 

812 

ATG GCT CAG CAT TTG ATA GAG AAC TCC AAA 
met ala gin his leu ile glu asn ser lys 

872 

GCA TCC TAT CAT CAT GCA GCG GAA ACC GTT 
ala ser tyr his his ala ala glu thr val 

932 

AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 
lys phe gly asp asn pro .glu ala ser lys 

992 

AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
lys arg cys phe thr phe met asp arg gly 

1052 

AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 
ser cys phe ala pro gly asp pro lys thr 

1112 

GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 
val val cys asn his glu his tyr ile pro 

1172 

AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
arg ile gin arg tyr gin asp leu gin leu 

1232 (Cadherin EC 

AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asn his phe leu val gly leu leu leu arg 

1292 

GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val arg leu ile ala ile ser val leu 

1352 

GAC AGA TAT GCT TCA AGG AGC CAT CAG GCA 
asp arg tyr ala ser arg ser his gin ala 

1412/471 

GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 
gly leu leu ile glu asn val gin arg ile 

1472 

AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn ala gly met thr val lys asp glu ser 




1502 

CTG GCT CTA CCA GCT GTG AAT CCG CTG GTG 
leu ala leu pro ala val asn pro leu val 

1562 

AGC CTG CAC AAG GAC CTG CTG GGC GCC ATC 
ser leu his lys asp leu leu gly ala ile 

1622 

ACT CCA AAC ATC AAC AGT GTG AGA AAT GCT 
thr pro asn ile asn ser val arg asn ala 

1682 

TCG GGT AAC AGC CTT CCA GAA AGG AAT AGT 
ser gly asn ser leu pro glu arg asn ser 

1742 

CAA AGT AGC ACA TTG GGA AAT TCC GTG GTT 
gin ser ser thr leu gly asn ser val val 

M.802 

jjAAG AGC CTA CTG ATG TGT TTC CTC TAC ATC 
~lys ser i eu i eu met cys phe leu tyr ile 

SIL862 

HkCA TAT TGG AAC AAG GCT TCA ACA TCT GAA 
Mjthr tyr trp asn lys ala ser thr ser glu 

s 1922 

L^TGC CTG CAC CAG TTC CAG TAC ATG GGG AAG 
Qcys leu his gin phe gin tyr met gly lys 

tlJ 1982 

f£ GGA CCC ATA GTT CAT GAT CGA AAG TCT CAG 
i gly pro ile val his asp arg lys ser gin 

2042 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC 
met met his ala arg leu gin gin leu gly 

2102 

AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG 
ser tyr gly his ser asp ala asp val leu 

2162 

ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG 
thr glu val cys leu thr ala leu asp thr 

2222 

CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT 
gin leu leu ala asp his gly his asn pro 

2282 

TGT TTT CTT CAA AAA CAT CAG TCT GAA ACG 



1532 

ACG CCG CAG AAG GGA AGC ACC CTG GAC AAC 
thr pro gin lys gly ser thr leu asp asn 

1592 

TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 
ser gly ile ala ser pro tyr thr thr ser 

1652 

GAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 
asp ser arg gly ser leu ile ser thr asp 

1712 

GAG AAG AGC AAT TCC CTG GAT AAG CAC CAA 
glu lys ser asn ser leu asp lys his gin 

1772 

CGC TGT GAT AAA CTT GAC CAG TCT GAG ATT 
arg cys asp lys leu asp gin ser glu ile 

1832 

TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 
leu lys ser met ser asp asp ala leu phe 

1892 

CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 
leu met asp phe phe thr ile ser glu val 

1952 

CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 
arg tyr ile ala arg asn gin glu gly leu 

2012 

ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 
thr leu pro val ser arg asn arg thr gly 

2072 

AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
ser leu asp asn ser leu thr phe asn his 

2132 

CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 
his gin ser leu leu glu ala asn ile ala 

2192 

CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
leu ser leu phe thr leu ala phe lys asn 

2252 

CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 

leu met lys lys val phe asp val tyr leu 
2312 

GCT TTA AAA AAT GTC TTC ACT GCC TTA AGG 



cys phe leu gin lys his gin ser glu thr 
2342 

TCC TTA ATT TAT AAG TTT CCC TCA ACA TTC 
ser leu ile tyr lys phe pro ser thr phe 

2402 

CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC 
leu cys tyr glu ile leu lys cys cys asn 

2462 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC 
ser gin leu leu tyr phe leu met arg asn 

2522 

GTC CGG ACA CAT TTG CAA GTC ATC ATA TCT 
val arg thr his leu gin val ile ile ser 

2582 

^ATT GGG GAA ACC AGA TTC CAG CAG TCC CTG 
=|Lle gly glu thr arg phe gin gin ser leu 

^2642 

&GG CTT ATT AAG CAC ACC AGC TTC TCC TCT 
~4rg leu ile lys his thr ser phe ser ser 

J2702 

d^CG GTG CTA ATG GCC ACC GCC CAG ATG AAG 
thr val leu met ala thr ala gin met lys 

12762 

IgAC CTC CAG TAC AGC CTG GCC AAA TCC TAT 
^asp leu gin tyr ser leu ala lys ser tyr 



ala leu lys asn val phe thr ala leu arg 
2372 

TAT GAA GGG AGA GCG GAC ATG TGT GCG GCT 
tyr glu gly arg ala asp met cys ala ala 

2432 

TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
ser lys leu ser ser ile arg thr glu ala 

2492 

AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
asn phe asp tyr thr gly lys lys ser phe 

2552 

GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
val ser gin leu ile ala asp val val gly 

2612 

TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 
ser ile ile asn asn cys ala asn ser asp 

2672 

GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
asp val lys asp leu thr lys arg ile arg 

2732 

GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
glu his glu asn asp pro glu met leu val 

2792 

GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
ala ser thr pro glu leu arg lys thr trp 



2822 2852 | xxxxxxxxxxxxxxx Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

Transmembrane Domain xxxxxxxxxxxxxxxxxxxxxxxxxx | 

TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA GGC GTG TTT AGA CAA 
tyr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

2942 2972 

GGA TGC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 
gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 

3002 3032 

GAC GTG GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT GTG CTG ATG GAG CTC CTT GAG CAG 
asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 

3062 3092 

TGC GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 



cys ala asp gly leu trp lys ala glu arg 
3122 

ATC ATC CCC ATT TAT GAG AAG CGG AGG GAT 
ile ile pro ile tyr glu lys arg arg asp 

3182 

ATT TAC AAG GAA CCC AAA CTC ACA CCG CTG 
ile tyr lys glu pro lys leu thr pro leu 

3242 

TAC TCG GAT AAA TTT GGT TCT GAA AAT GTC 
tyr ser asp lys phe gly ser glu asn val 

3302 

CCT AAG GAT CTG GAT TCT AAG TAT GCA TAC 
pro lys asp leu asp ser lys tyr ala tyr 

3362 

J3AC GAA AAA GAG TTG CAA GAA AGG AAA ACA 

Cgsp glu lys glu leu gin glu arg lys thr 
53422 

SItC ATG TTT GAG ATG CCA TTT ACG CAG ACC 
^iphe met phe glu met pro phe thr gin thr 

y3482 

%jGC AAA CGG CGC ACC ATC CTG ACA GCC ATA 
5 cys lys arg arg thr ile leu thr ala ile 



§3542 

SfcCT GTC ATG TAC CAG CAC CAC ACT GAC CTG 
rjpro val met tyr gin his his thr asp leu 

3602 xxx Coiled-Coil 1 xxxxxxxxxxxxx 
AGT AAG AAG GTG GCG GAG CTC CGG CAG CTG 
ser lys lys val ala glu leu arg gin leu 

3662 xxxxxxxxxxxxxxxxxxxxxx | 

CTG CAG CTC AAA CTC CAG GGC AGC GTG AGT 

leu gin leu lys leu gin gly ser val ser 

3722 

GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA 
ala arg ala phe leu asp asp thr asn thr 

3782 

CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA 
leu lys glu val phe arg gin phe val glu 

3842 xxx Coiled-Coil 2 xxxxxxxxxxxxx 
CGT CTG ATT AAA GAA GAC CAG CTC GAG TAT 



tyr glu leu ile ala asp ile tyr lys leu 
3155 

TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
phe phe glu asp glu asp gly lys glu tyr 

3212 

TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 
ser glu ile ser gin arg leu leu lys leu 

3272 

AAA ATG ATA CAG GAT TCT GGC AAG GTC AAC 
lys met ile gin asp ser gly lys val asn 

3332 

ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
ile gin val thr his val ile pro phe phe 

3392 

GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
glu phe glu arg ser his asn ile arg arg 

3452 

GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 
gly lys arg gin gly gly val glu glu gin 

3512 

CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
his cys phe pro tyr val lys lys arg ile 



3572 | xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
asn pro ile glu val ala ile asp glu met 

3632 xxxx Coiled-Coil 1 xxxxxxxxxxxxx 
TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
cys ser ser ala glu val asp met ile lys 

3692 

GTT CAG GTC AAT GCT GGC CCA CTA GCA TAT 
val gin val asn ala gly pro leu ala tyr 

3752 

AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 
lys arg tyr pro asp asn lys val lys leu 

3812 | xxxxxxxxxxxxxxxxxx 

GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
ala cys gly gin ala leu ala val asn glu 

3 872 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 



f \6{. 1 ftwrt) 



arg leu ile lys glu asp gin leu glu tyr gin glu glu met lys ala asn tyr arg glu 



3902 xxx Coiled-Coil 2 xxxxxxxxxxxxx 3932 xxxxx) 

ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG ATC TGC CCC CTG GAG GAG AAG ACG 
met ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 

3962 3992 

AGC GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 
ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser thr 

4 022 |xxxx PBM xxxxj 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
met val his gly met thr ser ser ser ser val val STP 

4082 4112 

TGG GGA CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 
4142 4172 

ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 
^202 4232 

JfCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 
^262 4292 

jiCA ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 
U4322 4352 

%£TT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 
M4382 4412 

Q3TT CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG GGG GAC CTT TTG 
Li? 442 4472 

p5rCT CGA CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG TAC AGA ACT TAC TAG TTT TGT CTA 

^4502 4532 
GGA GTA TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 

4562 4592 

TAG TGA GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG CTA CTG GCT TCA AGT CAG AAC TTT 
4622 4652 

GTC ATT AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT ATT ACA TTT CTA CAT TTT TAA TAC 
4682 4712 

TCA CAT GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 



4742 4772 

ATA CAC TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 

4802 
TTT ACT 
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2 

GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT 
val leu his his his gin asn pro glu phe 

b2 

CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC 
gin leu his glu lys his his leu leu leu 

122 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC 
ser lys gly ser thr lys lys arg asp val 

152 

CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA 
j)ro leu leu lys asp gly arg val val thr 

ftM2 

^tTT CCT TCG GGC TAT CTT GGC TAC CAA GAG 
^leu pro ser gly tyr leu gly tyr gin glu 

003Q2 

Watt aaa tgg gta gat gga ggc aag cca ctg 

%Jile lys trp val asp gly gly lys pro leu 
U3b2 

OGTG TAT ACT CAG GAT CAG CAT TTA CAT A AT 

£Ival tyr thr gin asp gin his leu his asn 

2^22 

if GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA 
^'gly ala gin ala leu gly asn glu leu val 

M&2 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT 
gly his val met ile ala phe leu pro thr 

5M2 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC 
arg ala thr gin glu glu val ala val asn 

bQ2 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC 
gin cys his glu glu gly leu glu ser his 

bb2 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG 
ala glu pro tyr val ala ser glu tyr lys 




1 
A 

32 

TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
tyr asp glu ile lys ile glu leu pro thr 

=12 

ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
thr phe phe his val ser cys asp asn ser 

152 

GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
val glu thr gin val gly tyr ser trp leu 

212 

AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
ser glu gin his ile pro val ser ala asn 

2?2 

CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu gly met gly arg his tyr gly pro glu 

332 

CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
leu lys ile ser thr his leu val ser thr 

3^2 

TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
phe phe gin tyr cys gin lys thr glu ser 

M52 

AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
lys tyr leu lys ser leu his ala met glu 

512 

ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
ile leu asn gin leu phe arg val leu thr 

S72 

GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
val thr arg val ile ile his val val ala 

b32 

TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
leu arg ser tyr val lys tyr ala tyr lys 

b c J2 

ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
thr val his glu glu leu thr lys ser met 



FI61.Z6 




722 

A<C ACG ATT CTC AAG CCT TCT GCC GAT TTC 
thr thr ile leu lys pro ser ala asp phe 

7fi2 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT 
trp phe phe phe asp val leu ile lys ser 

BM2 ICadherln Cleavage! 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT 

val lys leu leu arg asn gin arg phe pro 

"102 

GTA A AT ATG CTG ATG CCA CAC ATC ACT CAG 
val asn met leu met pro his ile thr gin 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC 
asn ala asn his ser leu ala val phe ile 

Cl022 

#TT GTC TTC AAG CAG ATC AAC AAC TAC ATT 
O^he val phe lys gin ile asn asn tyr ile 

%40fi2 

OJTC TTT GAA TAC AAG TTT GAA TTT CTC CGT 
IdLeu phe glu tyr lys phe glu phe leu arg 

,1142 

yTTG AAC TTA CCA ATG CCA TTT GGA AAA GGC 
? 4eu asn leu pro met pro phe gly lys gly 

f^L2G2 

JiGAC TAC TCA TTA ACA GAT GAG TTC TGC AGA 
!zasp tyr ser leu thr asp glu phe cys arg 

XXX I 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG 
glu val gly thr ala leu gin glu phe arg 

1322 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT 
lys asn leu leu ile lys his ser phe asp 

1362 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT 
arg ile ala thr leu tyr leu pro leu phe 

1442 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG 
asn val arg asp val ser pro phe pro val 



4 

7S2 

CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 
leu thr ser asn lys leu leu arg tyr ser 

612 

ATG GCT CAG CAT TTG ATA GAG AAC TCC AAA 
met ala gin his leu ile glu asn ser lys 

fl?2 

GCA TCC TAT CAT CAT GCA GCG GAA ACC GTT 
ala ser tyr his his ala ala glu thr val 

T32 

AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 
lys phe gly asp asn pro glu ala ser lys 

*n2 

AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
lys arg cys phe thr phe met asp arg gly 

1052 

AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 
ser cys phe ala pro gly asp pro lys thr 

1112 

GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 
val val cys asn his glu his tyr ile pro 

1172 

AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
arg ile gin arg tyr gin asp leu gin leu 

1232 ICadherin EC 

AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asn his phe leu val gly leu leu leu arg 

12^2 

GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val arg leu ile ala ile ser val leu 

1352 

GAC AGA TAT GCT TCA AGG AGC CAT CAG GCA 
asp arg tyr ala ser arg ser his gin ala 

1M12 

GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 
gly leu leu ile glu asn val gin arg ile 

1472 

AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn ala gly met thr val lys asp glu ser 



15G2 1532 

CTG GCT CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG CAG A AG GGA AGC ACC CTG G AC AAC 

leu ala leu pro ala val asn pro leu val thr pro gin lys gly ser thr leu asp asn 

15b2 15^2 

AGC CTG CAC AAG G AC CTG CTG GGC GCC ATC TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 

ser leu his lys asp leu leu gly ala ile ser gly ile ala ser pro tyr thr thr ser 

3.1)22 lbS2 

ACT CCA AAC ATC AAC AGT GTG AGA AAT GCT GAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 

thr pro asn ile asn ser val arg asn ala asp ser arg gly ser leu ile ser thr asp 

lbB2 1712 

TCG GGT AAC AGC CTT CCA GAA AGG AAT AGT GAG AAG AGC AAT TCC CTG GAT AAG CAC CAA 

ser gly asn ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys his gin 

17M2 1772 

CAA AGT AGC ACA TTG GGA AAT TCC GTG GTT CGC TGT GAT AAA CTT GAC CAG TCT GAG ATT 

gin ser ser thr leu gly asn ser val val arg cys asp lys leu asp gin ser glu ile 

QfiD2 lfi32 

M£AG AGC CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 

Uiys ser leu leu met cys phe leu tyr ile leu lys ser met ser asp asp ala leu phe 



^iflt»2 16^2 

S&CA TAT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 
ythr tyr trp asn lys ala ser thr ser glu leu met asp phe phe thr ile ser glu val 

s 1^22 1T52 I xxxxxxxxxxxxxxxxxxxx 

yTGC CTG CAC CAG TTC CAG TAC ATG GGG AAG CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 
pcys leu his gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 

fTiTfl2 xxxxxxxxxx deleted in CLASP-2D (KIAA1058) xxxxxxxxxxxxxxxxxxxxxx I 

jif»GA CCC ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 

SSly pro ile val his asp arg lys ser gin thr leu pro val ser arg asn arg thr gly 

2D42 2072 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
met met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn his 

21D2 2132 

AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 
ser tyr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn ile ala 

Deleted 

21b2 2112 Ixxx 

ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
thr glu val cys leu thr ala leu asp thr leu ser leu phe thr leu ala phe lys asn 

in HC28 

xxx I 22S2 

CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 

gin leu leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 



2262 2312 

TGT TTT CTT CAA AAA CAT C AG TCT GAA ACG GCT TTA AAA AAT GTC TTC ACT GCC TTA AGG 
cys phe leu gin lys his gin ser glu thr ala leu lys asn val phe thr ala leu arg 

23^42 2372 

TCC TTA ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA GGG AGA GCG GAC ATG TGT GCG GCT 
ser leu ile tyr lys phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

2M02 2435 

CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyr glu ile leu lys cys cys asn ser lys leu ser ser ile arg thr glu ala 

2Mb2 2MT2 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
ser gin leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 

2S22 2552 

GTC CGG ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
val arg thr his leu gin val ile ile ser val ser gin leu ile ala asp val val gly 

11562 2bl2 

y|TT GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 
Ofile gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

sifc,M2 2b72 

ggGG CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
ijrg leu ile lys his thr ser phe ser ser asp val lys asp leu thr lys arg ile arg 

„2?02 2732 

L*CG GTG CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
Lthr val leu met ala thr ala gin met lys glu his glu asn asp pro glu met leu val 



I 27b2 27^2 

SAC CTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
!z^sp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2fi22 2852 I xxxxxxxxxxxxxxx Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

([Additional and differential exon usage found at position 2T27 consisting 
of fciT nucleotides- This entire sequence is found in Human CLASP— 2D 

(KIAAlDSfl) and not other isoforms of CLASP-2- It has a sequence of ' 
AAGCAGTCCAGTGGGAGCCGCCCCTTCTCCCCCACAGCCATAGCGCCTGCCTGAGGAGGAGCCGGGGAG3 

Transmembrane Domain xxxxxxxxxxxxxxxxxxxxxxxxxx I 
TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA ilGC GTG TTT AGA CAA 
tyr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

2TM2 2=172 

GGA TGC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 
gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 



30D2 I xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Sequence deleted in CLASP-2E xxxxx 

GAC GTG GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT GTG CTG ATG GAG CTC CTT GAG CAG 
asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 

3QL2 xxxxxxxxxxxxxxxxxx I 3CH2 

TGC GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 
cys ala asp gly leu trp lys ala glu arg tyr glu leu ile ala asp ile tyr lys leu 

([Additional and differential exon usage found at position 3153- The 
entire sequence below is found in Human GLASP-2D • Underlined sequence is 
found in Human CIASP-2B, 2C and 2E- 

TG AGAGGCTGGCCC ATCTGT ATGAC ACGCTGCACCGGGCCTACAGC AA AGTGACCG AGGTCAT 
GC ACTCGGGCCGC AGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGGCAGCGCAATACCAGTTT 
ACAGACAGTGAAAC AGATGTGGAGGGATTJ 

3122 T 3 15S 

ATC ATC CCC ATT TAT GAG AAG CGG AGG GAT TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
ile ile pro ile tyr glu lys arg arg asp phe phe glu asp glu asp gly lys glu tyr 

C31A2 3512 

OlATT TAC AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 
OOile tyr lys glu pro lys leu thr pro leu ser glu ile ser gin arg leu leu lys leu 

0Q3242 3272 

txJTAC TCG GAT AAA TTT GGT TCT GAA AAT GTC AAA ATG ATA CAG GAT TCT GGC AAG GTC AAC 
Sjtyr ser asp lys phe gly ser glu asn val lys met ile gin asp ser gly lys val asn 

L.3302 3332 

pIcCT AAG GAT CTG GAT TCT AAG TAT GCA TAC ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
Qpro lys asp leu asp ser lys tyr ala tyr ile gin val thr his val lie pro phe phe 

I^33ti2 33T2 

y GAC GAA AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
^asp glu lys glu leu gin glu arg lys thr glu phe glu arg ser his asn ile arg arg 

3M22 3452 

TTC ATG TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 
phe met phe glu met pro phe thr gin thr gly lys arg gin gly gly val glu glu gin 

34fl2 3512 

TGC AAA CGG CGC ACC ATC CTG ACA GCC ATA CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
cys lys arg arg thr ile leu thr ala ile his cys phe pro tyr val lys lys arg ile 

Two nucleotide deletion (nts 35flb and 35A7) found in Human CLASP-2C 



1 



3542 3572 I xxx I 

CCT GTC ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC ATC GAG GTG GCC ATT GAC GAG AT£ 
pro val met tyr gin his his thr asp leu asn pro ile glu val ala ile asp glu met 



4 . 



i 



3bD2 3b32 

AGT AAG AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
ser lys lys val ala glu leu arg gin leu cys ser ser ala glu val asp met ile lys 

3bb2 3b^2 

CTG CAG CTC AAA CTC CAG GGC AGC GTG AGT GTT CAG GTC AAT GCT GGC CCA CTA GCA TAT 
leu gin leu lys leu gin gly ser val ser val gin val asn ala gly pro leu ala tyr 

3722 37S2 

GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 
ala arg ala phe leu asp asp thr asn thr lys arg tyr pro asp asn lys val lys leu 

37A2 3fil2 

CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
leu lys glu val phe arg gin phe val glu ala cys gly gin ala leu ala val asn glu 

3642 3672 

CGT CTG ATT AAA GAA GAC CAG CTC GAG TAT CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 
arg leu ile lys glu asp gin leu glu tyr gin glu glu met lys ala asn tyr arg glu 

CInsertion 0 f nucleotides found only in Human CLASP-2D with sequence: CTGGGATG 



03^02 3^32 

SJtTG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG T ATC TGC CCC CTG GAG GAG AAG ACG 
ffmet ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 

vi3^b2 3^2 

^AGC GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 
^ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser thr 

^4 022 Ixxxx PBM xxxxl 

j">TG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
Jifmet val his gly met thr ser ser ser ser val val STP 

OqQfi2 M112 

TGG GGA CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 

4142 4172 

ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 
4202 M232 

TCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 
42b2 4212 

GCA ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 
4322 4352 

CTT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 
4362 4412 

GTT CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG GGG GAC CTT TTG 
4442 4472 



4 



CCT CGA CTC GTG CCG GAA ATC TGA TCG TAA 
4502 

GGA GTA TGT TGT ATG ACT AGG ATT TGT GCT 
MSbS 

TAG TGA GCT AAC TGA GCT AGA CAC TCA ATT 
4t22 

GTC ATT AAT CAT CGA CTC CGG GAC GGT CAT 
4bB2 

TCA CAT GGG CTT ATG CAT TAA GTT TAA TTG 
4742 

ATA CAC TTT AAT GGT TTA TTC TTG TCA TAA 

4602 
TTT ACT 




TCA GGG TAC AGA ACT TAC TAG TTT TGT CTA 
4532 

ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 
45^2 

AAT CCG CTA CTG GCT TCA AGT CAG AAC TTT 
4t52 

ATA TGT ATT ACA TTT CTA CAT TTT TAA TAC 
4712 

TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 
4772 

AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 



1 



I 



4 



HC2A 

HC2-80 

HC2B 

HC2C 

HC2D- KIAA1 058 GCATCTGGAAATCTTGACAAAAATGCCAGATTTTCTGCCATCTACAGGCAAGACAGCAAT 

HC2E 

HC2F 



HC2A 

HC2-80 

HC2B 

HC2C 

HC2D- KIAA1 058 AAGCTATCCAATGATGACATGCTCAAGTTACTTGCAGACTTTCGGAAACCTGAGAAGATG 

HC2E 

HC2F 

S$C2 A ■ 

ftC2-80 

^-ii C 2 B — ~ — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — — _ _ _ _ _ _ _ _» _ 

*itC2C 

~lkc2D - KIAA1 058 GCTAAGCTCCCAGTGATTTTAGGCAATCTAGACATTACAATTGATAATGTTTCCTCAGAC 

%C2E 

=^1C2F 



|^HC2A 

f~pC2-80 '■ 

pHC2B 

fHC2 C 

^HC2D-KIAA1 058 TTCCCTAATTATGTTAATTCATCATACATTCCCACAAAACAATTTGAAACCTGCAGTAAA 

MHC2E 

PHC2F : 



HC2A 

HC2-80 

HC2B 

HC2C 

HC2D- KIAA1 058 ACTCCCATCACGTTTGAAGTGGAGGAATTTGTGCCCTGCATACCAAAACACACTCAGCCT 

HC2E 

HC2F 



HC2A 

HC2-80 

HC2B 

HC2C . : 

HC2D- KIAA1 0 5 8 TACACCATCTACACCAATCACCTTTACGTTTATCCTAAGTACTTGAAATACGACAGTCAG 

HC2E 

HC2F 



HC2A 

HC2-80 

HC2B 

HC2C ; 

HC2D- KIAA1 058 AAGTCTTTTGCCAAGGCTAGAAATATTGCGATTTGCATTGAATTCAAAGATTCAGATGAG 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
^HC2C 

~jlC2D-KIAA1058 
y]iC2F 



GAAGACTCTCAGCCCCTTAAGTGCATTTATGGCAGACCTGGTGGGCCAGTTTTCACAAGA 



EHC2A 
hHC2-80 
S~HC2B 
HC2C 
[\HC2D - KIAA1 058 
CIHC2E 
MHC2F 



AGTTTTACACCATCACC AAAACCCAGAATTTTATGATGAGATTAAA 



AGCGCCTTTGCTGCAGTTTTACACCATCACCAAAACCCAGAATTTTATGATGAGATTAAA 



QHC2A 
QHC2-80 
~~HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



ATAGAGTTGCCCACTCAGCTGCATGAAAAGCAC CACCTGTTGCTCACATTCTTC CATGTC 



ATAGAGTTGCCCACTCAGCTGCATGAAAAGCACCACCTGTTGCTCACATTCTTCCATGTC 



HC2A AGCTGTGACAACTCAAGTAAAGGAAGCACGAAGAAGAGGGATGTCGTTGAAACCCAAGTT 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 AGCTGTGACAACTCAAGTAAAGGAAGCACGAAGAAGAGGGATGTCGTTGAAACCCAAGTT 

HC2E 

HC2F 





HC2A GGCTACTCCTGGCTTCCCCTCCTGAAAGACGGAAGGGTGGTGACAAGCGAGCAGCACATC 

HC2-80 

HC2B .- 

HC2C ^ 

HC2D- KIAA1 058 GGCTACTCCTGGCTTCCCCTCCTGAAAGACGGAAGGGTGGTGACAAGCGAGCAGCACATC 

HC2E 

HC2F 

HC2A CCGGTCTCGGCGAACCTTCCTTCGGGCTATCTTGGCTACCAAGAGCTTGGGATGGGCAGG 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1 058 CCGGTGTCGGCGAACCTTCCTTCGGGCTATCTTGGCTACCAGGAGCTTGGGATGGGCAGG 

HC2E 

HC2F 



HC2A CATTATGGTCCGGAAATTAAATGGGTAGATGGAGGCAAGCCACTGCTGAAAATTTCCACT 

HC2-80 

HC2B 

; = IiC2C — — — — — — ■ — . — — — __«___- _ _ — — . _ — — — — — — — — — — 

1|C2D-KIAA1058 CATTATGGTCCGGAAATTAAATGGGTAGATGGAGGCAAGCCACTGCTGAAAATTTCCACT 

3|C2E 

yg C2 F 

0&C2A CATCTGGTTTCTACAGTGTATACTCAGGATCAGCATTTACATAATTTTTTCCAGTACTGT 

b@C2-80 

SHC2B 

„HC2C ■■ 

LgC2D - KIAA1 058 CATCTGGTTTCTACAGTGTATACTCAGGATCAGCATTTACATAATTTTTTCCAGTACTGT 

pHC2E 

t=4j C 2F 

CHC2A CAGAAAACCGAATCTGGAGCCCAAGCCTTAGGAAACGAACTTGTAAAGTACCTTAAGAGT 

HKC2-80 

HC2B 

HC2C 

HC2D- KIAA1 058 CAGAAAACCGAATGTGGAGCCCAAGCCTTAGGAAACGAACTTGTAAAGTACCTTAAGAGT 

HC2E 

HC2F 

HC2A CTGCATGCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2-80 

HC2B GCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2C 

HC2D-KIAA1058 CTGCATGCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2 E GCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCCTAAACCAGCTG 

HC2F ■ = 





HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGAGTCGGGTCATT 

TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTCATT 

TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTCATT 
TTCCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTCATT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA105 8 

HC2E 

HC2F 



ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 

ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 

ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 
ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATGTT 



HC2A 

HC2-80 

HC2B 

"HC2D-KIAA1058 

3|C2E 

^02 F 



AAGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 

AAGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 

T^AGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 
AAGTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 



UiIC2A 
|i@C2-80 
SSC2B 
s HC2C 

LpC2 D - KI AA1 0 5 8 

pHC2E 

HlC2F 



CTGACCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAAC7^AA 

CTGACCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAACAAA 

CTGACCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAACAAA 
CTGACCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAACAAA 



UHC2A 
OHC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTACTGAGGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATTTG 

CTACTGAGGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATTTG 

CTACTGAAGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATTTG 
CTACTGAGGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATTTG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



ATAGAGAACTCCAAAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATCCTATCATCAT 

ATAGAGAACTCCAAAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATCCTATCATCAT 

ATAGAGAACTCCAAAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATCCTATCATCAT 
ATAGAGAACTCCAAAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATCCTATCATCAT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
.^HC2C 

^JiC2 D - KIAA1 058 

ljlC2E 

^HC2F 



3iC2A 
JHC2-80 
JHC2B 
"HC2C 

^HC2D-KIAA10 58 

^HC2E 

^HC2F 



JHC2A 
iHC2-80 
"HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GCAGCGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTGGAGATAAT 
GCAGCGGAAAC CGTTGTAAATATGCTGATGC C ACACATCACTC AGAAGTTTGG AGATAAT 

GCAGTGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTCGAGATAAT 
GCAGCGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTGGAGATAAT 

CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 
CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 
TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 
GGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

GAACATTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC 

GAACATTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC 

GAACATTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC 
GAACATTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC 

CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTGTGCAGAAACCACTTCTTGGTG 

TCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 

CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 

CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 
CAAGACCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTGGTG 



HC2A 
HC2-8 0 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 
GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 
GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 

GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 
GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GCCATCAGTGTGCTCAAGAACCTGCTGATAAAGCATTCTTTTGATGACAGATATGCTTCA 
GCCATCAGTGTGCTCAAGAACCTGCTGATAAAGCATTCTTTTGATGACAGATATGCTTCA 
GCCATCAGTGTGCTCAAGAACGTGCTGATAAAGCATTCTTTTGATGACAGATATGCTTCA 

GCCATCAGTGTGCTCAAGAACCTGCTGATAAAGCATTCTTTTGATGACAGATATGCTTCA 
GCCATCAGTGTGCTCAAGAACCTGCTGATAAAGCATTCTTTTGATGACAGATATGCTTCA 



HC2A 
HC2-80 
HC2B 
,0|C2C 

^J|C2D-KIAA1058 

3&C2E 

M&C2F 



AGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 
AGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 
AGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 

AGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 
AGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 



gJIC2A 
yJIC2-80 

HC2B 
_liC2C 

* HC2D-KIAA1058 

QlC2E 

MkC2F 



AACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGCATGACC 
AACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGCATGACC 
AACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTC C CTGTGAACGCGGGCATGACC 

AACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGCATGACT 
AACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGCATGACC 



r;HC2A 
f-HC2-80 
^HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GTGAAGGATGAATCCCTGGCTCTACCAGCTGTGAATCCGCTGGTGACGCCGCAGAAGGGA 
GTGAAGGATGAATCCCTGGCTCTACCAGCTGTGAATCCGCTGGTGACGCCGCAGAAGGGA 
GTGAAGGATGAATC CCTGGCTCTAC CAGCTGTGAATCCGCTGGTGACGCCGCAGAAGGGA 

GTGTIAGGATGAATCCCTGGCTCTACCAGCTGTGT^ATCCGCTGGTGACGCCGCAGAAGGGA 
GTGAAGGATGAATCCCTGGCTCTACCAGCTGTGAATCCGCTGGTGACGCCGCAGAAGGGA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 
AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 
AGCACCCTGGACAACAGCGTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 

AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 
AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 
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HC2A 
HC2-80 
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HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
pf!C2C 

.^C2D-KIAA1058 

JIHC2E 

J^HC2F 



S3HC2A 
LUHC2-80 
\!HC2B 
g ~HC2C 

L^HC2D-KIAA1058 

J^HC2E 

~ HG2F 

3 Z 

it: 

□ HC2A 
p HC2-80 
~~ HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 
CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 
CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 



CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 
CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 
_ GCTGATTCGAGAGGATCT 



CTCATAAGGACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 
CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 
CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 



CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 
CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 
CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 



CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 
CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 
CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 



CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 
CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 
CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 



GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 
GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 
GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 



GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 
GACCAGTCTGAGATTAAGAGGCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 
GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 



GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 
GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 
GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 



GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 
GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 
GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 



ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 
ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 
ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 



ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAG- 
ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 
ACAATATCTGAAGTCTGCCTGCAC CAGTTCCAGTACATGGGGAAGCGATACATAGCCAG - 



HC2A 
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HC2E 
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HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 
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HC2A 

HC2-80 

HC2B 

C 2 C 

^kC2D-KIAA10 58 

^HC2E 

^^HC2F 



gjHC2A 
[yHC2-80 

\J HC2B 
^HC2C 

7 a HC2D-KIAA1058 

tr HC2E 

^3HC2F 



QHC2A 
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HC2A 
HC2-80 
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HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 
AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 
AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 



r AA 

AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTGC 
TGTGA GAAAG ATATCAAGTGT 



CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 
CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 
CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 



CAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 

CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 
, GCTTGGAA 



CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 
CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 
CTCACTTTTT^ACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 



CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 
CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 
-TTTCTGTAGACAATGGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 



GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 
GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 
GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 



GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 
GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 
GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 



TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 
TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 
TTGGCGTTTAAG CTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 



TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 
TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 
TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAAAA 



TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 
TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 
TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 



TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 
TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 
A 





HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 
TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 
TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 

TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG. 
TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTCC 
GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTCC 
GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTGC 

GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTCC 
GACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTCC 



HC2A 

HC2-80 

HC2B 

□ HC2C 

SHC2D-KIAA1058 

SHC2E 

t~HC2F 



ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 
ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 
ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 

ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 
ATCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 



HC2A 
W HC2-80 
\JHC2B 
3 HC2C 

HC2D-KIAA1058 
^ HC2E 
~r S HC2F 



GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 
GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 
GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 

GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 
GGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 



U HC2A 
O HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GCAGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 
GCAGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 
GCAGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 

GCAGACGTTGTTGGCATTGGGGGAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 
GCAGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 
TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 
TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 

TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 
TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 





HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 
ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 
ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 
ACC AAAAGGATACGC ACGGTGCT AATGGCC AC CGCC C AG ATGAAGGAGC ATG AGAACGAC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 
CCAGAGATGCTGGTGGACCTCCAGTAC AGC CTGGCCAAATCCTATGCCAGCACGC C CGAG 
CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 

CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 
CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 



HC2A 
HC2-80 
HC2B 
^HC2C 

™ HC2D-KIAA1058 
V HC2E 
LM HC2F 



CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 
CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTC7^AAAATGGCGATCTCTCA 
CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 
CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 



II HC2A 
V] HC2-80 
fl HC2B 
^ HC2C 

HC2D-KIAA1058 

Z~ HC2E 

□ HC2F 
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GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 
GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 
GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 

GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 
GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 

G 

G 

G 

GAAGCAGTCCAGTGGGAGCCGCCCCTTCTCCCCCACAGCCATAGCGCCTGCCTGAGGAGG 
G 



HC2A GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2 - 80 GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2B GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2 C GTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2D- KIAA1 058 AGCCGGGGAGGCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2E GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2F 
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HC2D-KIAA1058 
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HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 
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HC2A 
HC2-80 
HC2B 
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~~ HC2D-KIAA1058 

HC2E 
¥-? HC2F 



yy hc2a 

yj HC2-80 
SJ HC2B 
a " HC2C 
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U. HC2E 
^ HC2F 



U HC2A 
Q HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 
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HC2D-KIAA1058 

HC2E 
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3 4 

G ACG AGGAGGCCTC C ATGATGGAAGACGTGGGGATGC AGGATGTC C ATTTC AACG AGGAT 
GACGAGGAGGCCTC CATGATGGAAGACGTGGGGATGCAGGATGTC CATTTCAACGAGGAT 
GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 
GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 
GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 
GACGAGGAGGCCTCCATGATGGAAGACGTGGGGA 



GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
AAGCCGAGCGCTACGAG 



CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTT 

CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTT 

CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 
CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 
CTCATTGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 
CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 



AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 
AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 
AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 
AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 



CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG 

CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG 

CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGGCAGCG 
CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG- 



CTTTGAAGATGAAGATGGA 

CTTTGAAGATGAAGATGGA 

GATTCTTTGAAGATGAAGATGGA 

GATTCTTTGAAGATGAAGATGGA 

CAATACCAGTTTACAGACAGTGAAACAGATGTGGAGGGATTCTTTGAAGATGAAGATGGA 
GATTCTTTGAAGATGAAGATGGA 
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AAGGAGTATATTTAC AAGGAACC CAAACTCAC AC CGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 



CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
•CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGACACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 



AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCCTACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 



CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGJVAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 



ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 



GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 
GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 
GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 
GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 
GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 
GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 



HC2F 




HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTTTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGT- - CCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 



HC2A 
HC2-80 
HC2B 
JJC2C 

M*C2D - KIAA1 058 

#C2E 

0&C2F 



ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 



g£C2A 

rs c2_8 ° 

:ffiC2B 
TkC2C 

s HC2D-KIAA1058 

H*C2E 
CHC2F 



CTAGCATATGCGCGAGCTTTCTTAGATGATAC7VAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 



iHC2A 
rHC2-80 
HiC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACG7VACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 



fl6|. W^dwif^ 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGC AG ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG - - ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGC AG ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAGCTGGGATGATCTGCC 
TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG- ATCTGCC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 



HC2A 
HC2-80 
HC2B 
r?iC2C 

J5IC2D-KIAA1058 
?LHC2F 



GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGATTAC 
GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGATTAC 

GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGA 

GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGA 

GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGATTAC 
GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGA 



HJHC2A 
UHC2-80 
SJHC2B 
g ~HC2C 

WiiC2 D - KIAA1 058 

JZHC2E 
^J HC2F 



ATCTCATGGCCCGTGTGTGGGGACTTGCTTTGTCATTTGCAAACTCAGGATGCTTTCCAA 
ATCTCATGGCCCGTGTGTGGGGACTTGCTTTGTCATTTGCAAACTCAGGATGCTTTCCAA 



ATCTCATGGCCCGTGTGTGGGGACTTGCTTTGTCATTTGCAAACTCAGGATGCTTTCCAA 



JHC2A 
JHC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AGCCAATCACTGGGGAGACCGAGCACAGGGAGGACCAAGGGGAAGGGGAGAGAAAGGAAA 
AGCCAATCACTGGGGAGACCGAGCACAGGGAGGACCAAGGGGAAGGGGAGAGAAAGGAAA 



AGCCAATCACTGGGGAGACCGAGCACAGGGAGGACCA - GGGGAAGGGGAGAGAAAGGAAA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TAAAGAACAACGTTATTTCTTAACAGACTTTCTATAGGAGTTGTAAGAAGGTGCACATAT 
TAAAGAACAACGTTATTTCTTAACAGACTTTCTATAGGAGTTGTAAGAAGGTGCACATAT 



TAAAGAACAACGTTATTTCTTAACAGACTTTCTATAGGAGTTGTAAGAAGGTGCACATAT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TTTTTTAAATCTCACTGGCAATATTCAAAGTTTTCATTGTGTCTTAACAAAGGTGTGGTA 
TTTTTTAAATCTCACTGGCAATATTCAAAGTTTTCATTGTGTCTTAACAAAGGTGTGGTA 



TTTTTTAAATCTCACTGGCAATATTCAAAGTTTTCATTGTGTCTTAACAAAGGTGTGGTA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GACACTCTTGAGCTGGACTTAGATTTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATG 
GACACTCTTGAGCTGGACTTAGATTTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATG 



GACACTCTTGAGCTGGACTTAGATTTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATG 



HC2A 
HC2-80 
HC2B 
^HC2C 

"SiC 2D- KIAA1 058 

.^HC2Hf 

rJiC2F 



GCCTACAGAAAAAAAAGGTTCTGGGATCTACATGGCAGGGAGGGCTGCACTGACATTGAT 
GCCTACAGAAAAAAAAGGTTCTGGGATCTACATGGCAGGGAGGGCTGCACTGACATTGAT 



GCCTACAGAAAAAAAAGGTTCTGGGATCTACATGGCAGGGAGGGCTGCACTGACATTGAT 



0-HC2A 
UMC2 - 8 0 
SJHC2B 
_ HC2C 

L.HC2 D - KIAA1 05 8 

^HC2E 

HhC2F 



GCCTGGGGGACCTTTTGCCTCGACTCGTGCCGGAAATCTGATCGTAATCAGGGTACAGAA 
GCCTGGGGGACCTTTTGCCTCGACTCGTGCCGGAAATCTGATCGTAATCAGGGTACAGAA 



GCCTGGGGGACCTTTTGCCTCGAGGCTGAGCTGGAAAATCTTGAAAATATTTTTT T 



QHC2A 
GHC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTTACTAGTTTTGTCTAGGAGTATGTTGTATGACTAGGATTTGTGCTATTATCTCATTCA 
CTTACTAGTTTTGTCTAGGAGTATGTTGTATGACTAGGATTTGTGCTATTATCTCATTCA 



TTTCCTGTGGCACATTCAGGTTGAATACAAGAACTATTTTTGTGACTAGTTTTTGATGAC 



HC2A ACAACATAGAGCAAGAATAGTGAGCTAACTGAGCTAGACACTCAATTAATCCGCTACTGG 

HC2 - 8 0 ACAACATAGAGCAAGAATAGTGAGCTAACTGAGCTAGACACTCAATTAATCCGCTACTGG 

HC2B 

HC2C 

HC2D - KIAA1 058 CTAAGGGAACTGACCATTGTAATTTTTGTACCAGTGAACCAGGAGATTTAGTGCTTTTAT 

HC2E 

HC2F 



HC2A CTTCAAGTCAGAACTTTGTCATTAATCATCGACTCCGGGACGGTCATATATGTATTACAT 

HC2 - 80 CTTCAAGTCAGAACTTTGTCATTAATCATCGACTCCGGGACGGTCATATATGTATTACAT 

HC2B 

HC2C 

HC2D- KIAA10 5 8 ATTCATTTCCTTGCATTTAAGAAAATATGAAAGCTTAAGGAATTATGTGAGCTTAAAACT 

HC2E 

HC2F 

HC2A TTCTACATTTTTAATACTCACATGGGCTTATGCATTAAGTTTAATTGTGATAAATTTGTG 

HC2 - 8 0 TTCTACATTTTTAATACTCACATGGGCTTATGCATTAAGTTTAATTGTGATAAATTTGTG 

HC2B 

HC2C 

HC2D- KIAA1 058 AGTCAAGCAGTTTAGAACCAAAGGCCTATATTAATAACCGCAACTATGCTGAAAAGTACA 

HC2E 

HC2F 



HC2A 

HC2-80 

HC2B 

JHC2D-KIAA10 58 

mC2E 

mC2F 



rgC2A 
J%C2-80 
% *IC2B 
s HC2C 

MHC2 D - KI AA1 0 5 8 

CPC2E 

L_=HC2F 



!zHC2A 
^02-80 

HC2B 

HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTGGTCCAGTATATGCAATACACTTTAATGGTTTATTCTTGTCATAAAAATGTGCAATAT 
CTGGTCCAGTATATGCAATACACTTTAATGGTTTATTCTTGTCATAAAAATGTGCAATAT 

AAGTAGTACAGTATATTGTTATGTACATATCATTGTTAATACAGTCCTGGCATTCTGTAC 

GGAGATGTATACAAGTCTTTACT 

GGAGATGTATACAAGTCTTTACT 

ATATATGTATTACATTTCTACATTTTTAATACTCACATGGGCTTATGCATTAAGTTTAAT 
TGTGATAAATTTGTGCTGTTCCAGTATATGCAATACACTTTAATGTTTTATTCTTGTACA 



HC2A 

HC2-80 

HC2B 

HC2C 

HC2D- KIAA1 058 TAAAAATGTGCAATATGGAGATGTATACAGTCTTTACTATATTAGGTTTATAAACAGTTT 

HC2E 

HC2F 



HC2A 

HC2-80 

HC2B 

HC2C 

HC 2 D - KIAA1 058 TAAGAATTTCATCCTTTTGC C AAAATGGTGGAGTATGTAATTGGT AAATCATAAAT C CTG 

HC2E 

HC2F 



HC2A .-- 

HC2-80 

HC2B 

HC2C 

HC2D - KIAA1 058 TGGTGAATGGTGGTGTACTTTAAAGCTGTCACCATGTTATATTTTCTTTTAAGACATTAA 

HC2E 

HC2F 



HC2A 

HC2-80 

HC2B 

:hc2C 

7HC2 D - KIAA1 058 TTTAGTAATTTTATATTTGGGAAAATAAAGGTTTTTAATTTTATTTAACTGGAATCACTG 

iHC2E 

=iHC2F 

J HC2A 

mC2-80 

JHC2B 

HC2C 

HC2D- KIAA10 5 8 CCCTGCTGTAATTAAACATTCTGTACCACATCTGTATTAAAAAGACATTGCTGACC 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A- 8 0 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



: =^HC2 A 

JHC2A-80 

DTHC2B 

gHC2C 

\HC2D 

^ C2E 
rHC2F 



s HC2A 
'rtIC2A-80 
CBC2B 
yHC2C 

iHC2E 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



ASGNLDKNARFSAIYRQDSNKLSNDDMLKLLADFRKPEKMAKLPVILGNLDITIDNVSSD 



FPNYVNSS YI PTKQFETCSKT FIT FEVEE FVPC I PKHTQP YT I YTNHL YVYPKYLKYDSQ 



VLHHHQNPE FYDE I K 



KS FAKARN I AI C I E FKDS DEEDS QPLKC I YGRP GGPVFTRS AFAAVLHHHQNPE FYDE I K 



IELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPLLKDGRWTSEQHI 



IELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 



PVSANLPS G YLG YQE LGMGRH YGPE I KWVDGGKPLLK I S THLVS TVY T QDQHLHNFFQ YC 



PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 



QKTESGAQALGNELVKYLKS LHAMEGHVMIAFLPT ILNQLFRVLTRATQEEVAVNVTRVI 



AMEGHVMIAFLPT ILNQLFRVLTRATQEEVAVNVTRVI 



QKTESGAQALGNELVKYLKS LHAMEGHVMIAFLPT ILNQLFRVLTRATQEEVAVNVTRVI 
AMEGHVMIAFLPT ILNQLFRVLTRATQEEVAVNVTRVI 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 
HC2A-80 
EC2B 
,JC2C 

Q*C2D 

SC2E 
3SC2F 



HfC2A 
1IC2A-80 
-HC2B 
?=HC2C 

gc2D 

L&C2E 
=*HC2F 



Mlc2A 
HC2A-80 
HC2B 
HC2C 
HC2D 
HC2E 
HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



IHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 



IHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 



IHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 
IHVVAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 



LLRYSW FFFDVL IKSMAQHL IENSKVKLLRNQRFPAS YHHAAETVVNMLMPH I TQKFGDN 



LLRYSWFFFDVLIKSMAQHLIENSKVKLLRNQRFPASYHHAAETVVXOMLMPHITQKFGDN 



LLKYSWFFFDVLIKSMAQHLIENSKVKLLRNQRFPASYHHAVETVVNMLMPHITQKFRDN 
LLRYSWFFFDVLIKSMAQHLIENSKVKLLRNQRFPASYHHAAETVVNMLMPHITQKFGDN 



PEAS KNANHS LAVFI KRC FT FMDRG FVFKQ I NNY ISC FAPGDPKT L FE YK FE FLR WCNH 



PEAS KNANHS I^VFIKRCFT FMDRG FVFKQINNYISCFAPGDPKTLFEYKFEFLRVVCNH 



PEAS KNANHS LAVFIKRCFTFMDRGFVFKQINNYISCFAPGDPKTLFEYKFEFLRWCNH 
PEASKNANHSLAVFIKRCFTFMDRGFVFKQINNYISCFAPGDPKTLFEYKFEFLRVVCNH 



EHY I PLNLPMPFGKGRI QRYQDLQLDYS LTDE FCRNHFLVGLLLREVGTALQE FREVRL I 

QLDYSLTDE FCRNHFLVGLLLREVGTALQE FREVRL I 

EHY I PLNLPMPFGKGRI QRYQDLQLDYS LTDE FCRNHFLVGLLLREVGTALQE FREVRL I 



EHY I PLNLPMPFGKGRI QRYQDLQLDYS LTDE FCRNHFLVGLLLREVGTALQE FREVRL I 
EHY I PLNL PMP FGKGR I QRYQDLQLDYS L T DE FCRNH FLVGLLLREVGTALQE FREVRL I 



AISVLKNLLIKHSFDDRYASRSHQARIATLYLPLFGLLIENVQRINVRDVSPFPVNAGMT 
AI S VLKNLL IKHS FDDRYASRSHQARI ATLYLPL FGLL I ENVQRINVRDVS P FPVNAGMT 
AISVLKNLLIKHSFDDRYASRSHQARIATLYLPLFGLLIENVQRINVRDVSPFPVNAGMT 



AISVLKNLLIKHS FDDRYASRSHQARIATLYLPLFGLLIENVQRINVRDVSPFPVNAGMT 
AI S VLKNLL I KHS FDDRYAS RS HQARI AT L YL P L FGLL I ENVQR I NVRDVS P FPVNAGMT 



VKDESLALPAVNPLVTPQKGSTLDNSLHKDLLGAISGIASPYTTSTPNINSVRNADSRGS 
VKDESLALPAVNPLVTPQKGSTLDNSLHKDLLGAISGIASPYTTSTPNINSVRNADSRGS 
VKDES LAL PAVNPLVT PQKGS TLDNS LHKDL LGAI SG I AS P YTTST PN I NS VRNADS RGS 



VKDES LAL P AVNP LVT P QKGS T LDNS LHKDL LGAI S G IAS P Y T T S T PN I NS VRNADS RGS 
VKDESLALPAVNPLVTPQKGSTLDNSLHKDLLGAISGIASPYTTSTPNINSVRNADSRGS 
ADSRGS 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

yb2D 

m:2K 

8§C2F 



fl|C2A 
jj§C2A-80 
~fiC2B 
HC2C 
S HC2D 
NtC2E 
IHC2F 



mC2A 

fl|C2A-80 

4IC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



L I S TDSGNS LPERNSEKSNSLDKHQQSS TLGNSWRCDKLDQSE IKSLLMCFLY I LKSMS 
LISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSEIKSLLMCFLYILKSMS 
LIS TDSGNSLPERNSEKSNS LDKHQQSS TLGNSWRCDKLDQSE I KSLLMCFLY I LKSMS 



LISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSEIKSLLMCFLYILKSMS 
LIS TDS GNS LPERNS EKSNS LDKHQQSS TLGNSWRCDKLDQSE I KS L LMC FLY I LKSMS 
L I S TDS GNS L PERNS EKS NS LDKHQQS S TLGNS WRCDKL DQS E I KS LLMC FLY I LKSMS 



DDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTLPVS 
DDAL FT YWNKAS TS E LMDF FT I S E VCLHQ FQ YMGKRY I ARNQE GLG P I VHDRKS Q T L P VS 
DDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTLPVS 



DDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIAR 

DDALFTYWNKASTSEmDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTLPVS 
DDAL FT YWNKAS TS E LMDFFT I SE VCLHQFQYMGKRY IAS VR — KISSVLGIS 



RNRTGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 
RNRT GMMHARLQ QLGS LDNS L T FNHS Y GHS DADVLHQS L LEAN I ATE VCL T ALDT LS L FT 
RNRTGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 



TGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 

RNRTGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 
V D-NG YGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 



LAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLI^ 
LAFKNQLIJVDHGHNPLMKPCVFDVYLC FLQKH QS E TALKNVFTALRS L I YKFPS T FYE GRA 
LAFK — LLADHGHNP LMKKVFDVYLC FLQKHQS E TALKNVFT ALRS L I YKFPS T FYE GRA 



LAFKNQLLADHGHNPLMKKVFDVYLC FLQKHQSE TALKNVFTALRS L I YK FPS T FYE GRA 
LAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIYKFPS^ 
LAFKNQLLADHGHNPLMKKK 



DMCAALCYEILKCCNSKLSS IRTEASQLLYFLMRNNFDYTGKKS FVRTHLQVI ISVSQLI 
DMCAALCYE ILKCCNSKLSS IRTEASQLLYFLMRNNFDYTGKKS FVRTHLQVI ISVSQLI 
DMCAALCYEILKCCNSKLSSIRTEASQLLYFIJ^RNNFDYTGKKSFVRTHLQVI ISVSQLI 



DMCAALCYE I LKCCNSKLSS IRTEAS QLLYFLMRNNFDYTGKKS FVRTHLQVI I SVSQLI 
DMCAALCYEILKCCNSKLSS IRTEASQLLYFLMRNNFDYTGKKS FVRTHLQVI ISVSQLI 



ADWG I GETRFQQS LS 1 1 NNCANS DRL I KHTS FSS DVKDLTKR I RT VLMATAQMKEHEND 
AD WG I GE TR FQQS LS 1 1 NNCANS DRL I KH TS FS S DVKDLTKR I RTVLMATAQMKEHEND 
ADWG I GE TRFQQS LS 1 1 NNCANS DRL I KHTS FSS DVKDLTKRI RT VLMATAQMKEHEND 



ADWG I GGTRFQQS LS I INNCANS DRL I KHTS FSS DVKDLTKRI RTVLMATAQMKEHEND 
ADWG I GETRFQQS LS I INNCANS DRL I KHTS FS S DVKDLTKRI RTVLMATAQMKEHEND 



HC2A PEMLVDLQYSIJ^KSYASTPELRKTWLDSMARIHVKNGDLSEAAMCYVHVTALVAEYLTRK 

HC2A-80 PEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKNGDLSEAAMCYVHVT^ 

HC2B PEMLVDLQ YS LAKS YAS T PELRKT WLDSMAR I HVKNGDLS EAAMC YVHVTALVAE YLTRK 

HC2C ; 

HC2D PEMLVDLQYSLAK5 YAS TPELRKTWLDSMARI HVKNGDLS EAAMC YVHVTALVM 

HC2E PEMLVDLQYS LAKS YAS T PELRKTWLDSMAR I HVKNGDLS EAAMC YVHVTALVAE YLTRK 

HC2 F . 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



GVFRQGCTAFRVI T PN I DEEASMME DVGMQDVH FNE 

G VFRQGC T AFRV ITPNIDE EASMME DVGMQDVH FNE 

GVFRQGCTAFRVI TPN I DEEASMME DVGMQDVH FNE 

FRQGC T AFRV I T PN I DEEASMME DVGMQDVH FNE 

EAVQWE PPLLPHSHSACLRRSRGGVFRQGC TAFRVI T PN I DEEASMMEDVGMQDVH FNE 
GV FRQGC T AFRV ITPNI DEEASMME DVG 



HC2A 

HC2A-80 

HC2B 
S ^C2C 
^C2D 
^C2E 
5fC2F 



DVLMELLEQCADGLWKAERYEL IAD I YKL 1 1 P I YEKRR 

DVLMELLE QCADGLWKAERYEL IAD I YKL 1 1 P I YEKRR 

DVLMELLE QCADGLWKAERYEL I AD I YKL 1 1 P I YEKRRD FERLAHL YDTLHRAYS K 
DVLMELLE QCADGLWKAERYE L IAD I YKL 1 1 P I YEKRRDFERLAHL YDT LHRAYSK 

DVLMELLEQCADGLWKAERYELIADIYKLI I PI YEKRRD FERLAHL YDTLHRAYS K 
KAERYEL I AD I YKL 1 1 P I YEKRRD FERLAHL YDT LHRAYSK 



fflC2A 
[HC2A-80 
CHC2B 
lC2C 
;HC2D 
5&C2E 
^=^iC2F 



. ; DFFEDEDGKE Y I YKEPKLTPLSE 

' : DFFEDEDGKEYI YKEPKLTPLSE 

VTEVMHSGRRLLGTYFRVAFFGQ GFFEDEDGKEY I YKEPKLTPLSE 

VTEVMHSGRRLLGTYFRVAFFGQ GFFEDEDGKEY I YKEPKLTPLSE 

VTE VMHS GRRLLG T Y FRVAF FG QAAQ YQ FT DSETDVE GFFEDEDGKEY I YKEPKLTPLSE 
VTEVMHSGRRLLGTYFRVAFFGQ GFFEDEDGKEY I YKEPKLTPLSE 



QIC2A 
H&C2 A- 8 0 
"fec2B 

HC2C 

HC2D 

HC2E 

HC2F 



I S QRLLKL YS DKFGS ENVKM I QDS GKVNPKDLDS KYAY I QVTHV I P FFDEKELQERKTE F 
I S QRLLKL YS DKFGS ENVKM I QDS GKVNPKDLDS KYAY I QVTHVI P FFDEKELQERKTE F 

I SQRLLKLYSDKFGSENVKMI QDS GKVNPKDLDSICYAY I QVTHV IP FFDEKELQERKTE F 
I S QRLLKL YS DKFGS ENVKMT QDS GKVNPKDLDS KYAY I QVTHVI P FFDEKELQERKTE F 

I SQRLLKLYS DKFGS ENVKM I QDS GKVNPKDLDS KYAY I QVTHVIPFFDE ICE LQERKTEF 
I S QRLLKLYS DKFGS ENVKM I QDS GKVNPKDLDS KYAY I QVTHVI P FFDEKELQERKTE F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



ERSHNIRRFMFEMPFTQTGKRQGGVEEQCKRRTILTAIHCFPYVKKRIPVMYQHHTDLNP 
ERS HN IRRFM FEMP FT QTGKRQGGVEE QCKRRT I LTAI HC FP YVKKRI PVMYQHHT DLNP 

ERSHNIRRFMFEMPFTQTGKRQGGVEEQCKRRTILTAIHCFPYVKKRIPVMYQHHTDLNP 
ERS HNIRRFM FEMP FTQTGKRQGGVEEQCKRRTILTAIHCFP YVKKRI PFMYQHHTDLNP 
ERSHNIRRFMraMPFTQTGKRQGGVEEQCKRRTI LTAIHCFPYVKKRI PVMYQHHTDLNP 

ERS HNIRRFM FEMP FTQTGKRQGGVEEQCKRRT I LTAI HCFPYVKKRI PVMYQHHTDLNP 



4 

f 1ft. Ctort.) 



HC2A 

HC2A-8 0 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

1?p2D 

gC2E 

lffc2F 



I E VAI DEMSKKVAELRQLCS SAEVDMI KLQLKLQGS VSVQVNAGPLAYARAFLDDTNTKR 
I EVAI DEMSKKVAELRQLCS SAEVDMI KLQLKLQGS VSVQVNAGPLAYARAFLDDTNTKR 
IEVAI DEMSKKVAELRQLCS SAEVDMI KLQLKLQGS VSVQVNAGPLAYARAFLDDTNTKR 

IEVH2 

I E VAI DEMSKKVAELRQLCS S AE VDM I KLQLKLQGS VSVQVNAGPLAYARAFLDDTNTKR 
IEVAIDEMSKKVAELRQLCSSAEVDMIKLQLKLQGSVSVQVNAGPLAYARAFLDDTNTPCR 



YPDNKVKLLKEVFRQFVEACGQALAVNERL I KEDQLEYQEEMKANYREMAKELSE IMHEQ 
YPDNKVKLLKEVFRQFVEACGQALAVNERL I KEDQLE YQEEMKANYREMAKELSE IMHEQ 
YPDNKVKLLKEVFRQFVEACGQALAVNERL I KEDQLE YQEEMKANYREMAKELSE IMHEQ 



YPDNKVKLLKEVFRQFVEACGQALAVNERL I KEDQLE YQEEMKANYREMAKELSE IMHEQ 
YPDNKVKLLKEVFRQFVEACGQAIAVNERLIKEDQLEYQEEMKANYREMAKELSEIMHEQ 



ICPLEEKTSVLPNSLHIFNAISGTPTSTMVHGMTSSSSWZ 

ICPLEEKTSVLPNSLHIFNAISGTPTSTMVHGMTSSSSWZ 

ICPLEEKTSVLPNSLHIFNAISGTPTSTMVHGMTSSSSWZ 



LG 

ICPLEEKTSVLPNSLHIFNAISGTPTSTMVHGMTSSSSWZ 






m 
Id 



hi 
O 

m 
O 



7.5 kb 




a. co 



CN 




HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



HC2A 
KIAA 

^3 

Ms 



Hf 2A 
6jAA 
rat 
JiC4 

IP 1 
lib 3 

hfC5 



g4C2A 
TRlAA 
rat 
HC4 
HC1 
HC3 
HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



AS GNLDKNARFSAI YRQDSNKLSNDDMLKLLADFRKPEKMAKLPVI LGNLD I T I DNVS S D 



FPNYVNS S Y I PTKQFE TC SKT PIT FE VEE FVPC I PKHTQPYT I YTNHLYVYPKYLKYDS Q 



VLHHHQNPE FYDE I K 

KS FAKARNI AI C I E FKDS DEEDS QPLKC I YGRPGGPVFTRSAFAAVIjHHHQNPEFYDE IK 



IELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 
I ELPTQLHEKHHLLLT FFHVS CDNSSKG S TKKRDWETQVGYSWLPLLKDGRWTSEQH I 



PVSANLPSGYLG YQELGMGRHYGPE I KWVDGGKPLLKI S THLVS TVYTQDQHLHNFFQYC 
PVSANLPSGYLGYQELGMGRHYGPEIKVirVT)GGKPLLKISTHLVSTVYTQDQHLHNFFQYC 



GPGPARS TVS I S L I SNSARV 



QKTE S GAQALGNE LVKY LKS LHAME GHVM I AFLP T I LNQL FRVL T - RAT QEE VAVNVT RV 
QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 



. MEIQVLIRFLSVILMQLFWVLPNMIHEDDVPISCPMV 

MS FLP 1 1 LNQLFKVL V- QNEEDE ITT TVTRV 

NRSRSLSNSNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 





HC2A I IHWAQCHE^^»SHLRS YVKYAYKAEPYVASEYKTVHEEL'^MrTILKPSADFLTSN 

KIAA I IHWAQCHE^KsHLRS YVKYAYKTVEPYVASEYKTVHEElSWtTILKPSADFLTSN 

rat 

HC4 LFHIVSKCHEEGLDSYLSSFIKYSFRPGKPSAPQAPLIHETIATMMIAIjLKQSADFLAIN 

HC1 LPDIVAKCHEEQLDHSVQSYIKFVFKTR ACKERPVHEDLAKNVTGLLK-SNDSPTVK 

HC3 TQAMDRSCNRMSSHTETSSFLQTLTGRLP TKKLFHEELALQWVVCSG — SVR E 

HC5 



Cadherin 



KLLR YS WFFFDVL IKSMAQHL IENSKVKLL 
KLLKYSW FFFDVL I KSMAQHL I ENS KVKLL 


*NQR 
■<NQF 


FPAS YHHAAE TWNMLMPH I TQKFGD 
FPAS YHHAVE T WNMLMPH I T QK FRD 


KLLKYSWFFFEI IAKSMATYLLEENKIKLT 
HVLKHSWFFFAI I LKSMAQHL I DTNKIQLP 
SALQQAWFFFELMVKSMVHHLYFNDKLEAP 


iGQR 
*PQP 
*KSR 


FPKAYHHALH S L FLA I T-IVES QYAE 
FPE S YQNE LDNL VMVL S DHV I WKYKD 
FPERFMDDIAALVSTIASDIVSRFQK 



HC2A NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NY IS — CFAPGDPKTLFEYKFEFL 

KIAA NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS — CFAPGDPKTLFEYKFEFL 

rat 

HC4 IPKESRNVNYSLASFLKCCLTLMDRGFVFNLIN DYIS — GFS PKDPKVLAE YKFE FL 

HC 1 ALEE T RRATHS VARFLKRC FT FMDRGC VFKMVN NYIS — MFSS GDLKT LCQYK FD FL 

HC3 DTEMVERLNTSLAFFLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVLVSLRLDFL 



H^2A RWCNHEHY IPLNLPM PFGKGRIQR YQDLQL DYSLTDEF 

KJAA RWCNHEHYIPLNLPM PFGKGRIQR YQDLQL DYSLTDEF 

r^sit • 

IfcM QT I CNHEHY IPLNLPM AFAKPKLQR VQDSNL EYSLSDEY 

QEVCQHEHFIPLCLPIRSANIPDPLTPSES TQELHASDMPE Y S VTNE F 

H"^3 RIICSHEHYVTLNLPCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELS — VPF 

HC5 MNADTAPTSPCPSIS SQNSSSCSSFQDQKIASMFDRTSRVPA 

ftC 2 A CRNH FLVGLLLREVG TALQE FRE VRL I AI SVLKNLL I KHS FDDRYAS RS HQAR I AT 

f< J AA CRNH FLVGLLLREVG TALQE FRE VRL I AI SVLKNLL I KHS FDDRYAS RS HQAR I AT 

fat ■ 

fit 4 CKHH FLVGLLLRE T S I ALQDNYE 1 RYTAI S VI KNLL IKHAFDTR YQHKNQQAK I AQ 

tet 1 CRKHFL I G I LLREVG FALQEDQD VRHLALAVLKNLMAKHS FDDRYRE PRKQAQ IAS 

HC3 RQQHYLAGLVLTELAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 

HC5 S STS - S PGLLFTELAAALDAEGEG I SEVQRKAVSAIHSLLS SHDLDPRCVKPEVKVKI AA 

HC2A LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 

KIAA LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 

rat 

HC4 LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP — AN — RGSLS 

HC1 LYMPLYGMLLDNMPRI YLKDLYPFTVNTSNQGSRDDLS TNGGFQSQTAIKHANSVDTSFS 

HC3 LYLPLIGIIMETVPQLYDFTETHNQRGRPICIATDDYESE SG SMIS 

HC5 LYLPLVGI ILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 

HC2A KDLLGAISGIASPYTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 

KIAA KDLLGAI SG I AS PYTTSTPNINS VRNADSRGSL I S TDSGNSL PERNS EKSNSLDKHQQS S 

rat 

HC4 TDKDTAYGS FQNG HGIKREDSRGSLIP-EGATGFPDQGNTGEN TRQS 

HC1 KDVLNS I AAFS S IAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPL 

HC3 QTVAMAIAGTSVPQ LTRPGSFLLTSTSGRQHT — 

HC5 QNVALA I AGNN FN LKTSG-IVLSSLPYKQYN 



HC2A TLGNSVVRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

KIAA TLGNSWRCDKLDQSEIKSLliMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

rat 

HC4 STRSSVSQYNRLDQYEIRSLLMCYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 

HC1 ALIGSTLRFDRLDQAETRSLLMCFLHIMKTISYETLIAYWQ-RAPSPEVSDFFSILDVCL 

HC3 TFSAESSRSLLICLLWVLKN-ADETVLQKWFTDLSVLQLNRLLDLLYLCV 

HC5 MLNADTTRNLMI C FLW IMKN-ADQSLIRKW I ADLPSTQLNRI LDLLFI CV 

HC2A HQFQYMGKRY I ARNQEGLG — PIVHDRKS QTLPVSRNRTGMM 

KIAA HQFQYMGKRY I AR TGMM 

rat — \ 

HC 4 FHFRYMGKRN I ARVHDAWL S KH FG I DRKS QTMPALRNRSGVM 

HC1 QNFRYLGKRNI IRKIAAAF — KFVQS T QNNG T LKG SNP S CQT S GLLAQWMH S T SRHEGHK 

HC 3 S C FE YKGKKVFERMNS LTFK — KS KDMRAK LEEAI LGSI GARQEMV 

HC5 LCFEYKGKQSS DKVS T QVLQ — KSRDVKAR LEEALLRGE GARGEMM 



HC2A HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

KIAA HARLQQL GSLDNS LT FNHS YGHS DADVLHQS LLEAN I ATEVC 

rat : 

HG4 QARLQHL SSLESS FTLNHSSTTTEADI FHQALLEGNTATEVS 

Jtlti QHRSQTLPI IRGK NALSNPKL LQMLDNTMTSNSNEIDIVHHVDTEANIATEGC 

rfSft RRSRGQLERSPSGSAFGSQENLRWRKDMTHWRQNTEKLDKSRAEIEHEALIDGNLATEAN 

H& RRRAPGNDRFP GLNENLRWKKEQTHWRQANEKLDKTKAELDQEALISGNLATEAH 



HS 2 A LTALDTLSLFTIAFKNQLLADHGHNPLMKKVFD^ 

KIAA LTALDTLSLFTLAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALI^ 

E a t KLSRGHS PLMKKVFDVYLC FLQKHQSEMALKNVFTALRS LIY 

HC4 LTVLDTISFFTQCFKTHFLNNDGHNPLMKKVFDIHIAFLKNGQSEVSLKHVFASLRAFIS 

HC1 LTILDLVSLFTQTHQRQLQQCDCQNSLMKRGFDTYMLFFQVNQSATALKHVFASLRLFVC 

gfc 3 L 1 1 LDTLE IWQTVS — VTE S — KESI LGGVLKVLLHSMACNQS AVYLQHC FATQRALVS 

MC 5 L 1 1 LDMQEN 1 1 QAS S — ALDC — KDS LLGGVLRVLVNS LNCDQS TT YLTHC FATLRAL I A 



PC2A KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKS FVRTH 

gilAA KFPS T FYEGRADMCAALC YE I LKCCNSKL SSI RTEAS QLL YFLMRNNFD YTGKKS FVRTH 

Tat KFPSTFYEGRADMCASLCYEVLKCCNSKLSSIRTEASQLLYFI^IRNNFDYTGKKSFVRTH 

HC4 KFPSAFFKGRVl^CAAFCYEVLKCCTSKISSTRNEASALLYLLMRNNFEYTKRKTFLRTH 

HC1 KFPSAFFQGPADLCGS FC YE VLKCCNHRSRS TQTEASALLYLFMRKNFEFNKQKS I VRSH 

HC3 KFPELLFEEETEQCADLCLRLLRHCSS S IGT I RSHPSASLYLLMRQNFE I GN — NFARVK 

HC5 KFGDLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFS FGATS — NFARVK 



HC2A LQV 1 1 S VS QL I ADWG I GE TRFQQS L S 1 1 NNCANS DRL I KHTS FS S DVKDLTKRI RTVLM 

KIAA LQVI I SVSQLIADWGIGGTRFQQSLS I INNCANS DRL I KHTS FSS DVKDLTKRI RTVLM 

rat hQVIIShSQhIAD^r^GIGGTRFQQSLSlImC^SDRLIKHTSFSSDVKDhTKRIRT^^JM 

HC4 LQI I IAVSQLIADVALSGGSRFQESLFI INNFANSDRPMLARAFPAEVKDLTKRIRTVLM 

HC 1 LQL I KAVS QL IAD-AG I GGSRFQHS LAI TNNFANGDKQMKNSNFPAEVKDLTKR I RTVLM 

HC3 MQVPMSLS S LVGTS QNFNEE FLRRS LKT I LT YAEEDLELRE T TFPDQVQDLVFNLHM I LS 

HC5 MQVTMSLASLVGRAPDFNEEHLRRSLRTILAYSEEDTAMQMTPFPTQVEELLCNLNSILY 





ansmenibrane 



ATAQMKEHENDPEMLVDLQYSIoAKSYASTPELRKTWLDSMARIHVKNGELSEAAMCYVHV 
ATAQJ4KEHENDPEMLVDLQYSIiAKSYASTPELRKTWLDSMARIHVKNGDLSEAAMCYVHV 
ATAQMKEHENDPEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKNGE LSEAAMCYVHV 
ATAQMKEHEKDPEMLIDLQYSLAKSYASTPELRKTWLDSMAKIHVKNGDFSEAAMCYVHV 
ATAQMKEHEKDPEMLVDLQYS LANS YAS T PELRRTWLE SMAKI HARNGD LSEAAMC Y I H I 
DTVKMKEHQEDPEMLID3LMYRIAKGYQTSPDLRLTWLQNMAGKHSERSNHAEAAQCLVHS 
DTVKMREFQEDPEMLMDLMYRIAKSYQASPDLRLTWLQNMAEKHTKKKC YT 



SH3 



TALVAEYI TRKGV 

TALVAEYI TRKEA 

TALVAEYI TRKEAD 

AALVAEFI HRKKL 



- - frqgctafrvi t pn 
— vqweppllphshsaclrrsrlggvfrqgctafrvitpn 
•lalqreppvfpyshtscqrksrIggmfrqgctafrvitpn 

•fpngcsafkki tpn 

aaliaeylikrkgywkvekicftasllsedthpcdsnsllttpsglgsmfsmgwpaflsitpn 
aalvaeyi smled- 
aalvae yi smled- 



RKYLPVGCVT FQN ISSN 
HS YLPVGSVS FQN I S SN 



I DE EASMME DVGMQD VHFNEDVLMELLEQCADGLWKAERYEL IAD I YKL 1 1 P I 

I DEEASMMEDVGMQD VH FNE DVLME L LE QCADGLWKAE RYE L IAD I YKL 1 1 P I 

I DEEASMMEDVGMQD VH FNEDVLMELLEQCADGLWKAERLRAGLLT SINSSSP 

I DEEGAMKEDAGMMD VH YSEEVLLELLEQCVNGLWKAERYE IISEISKLIGPI 

I KEE GAAKEDS GMHD TPYNENILVEQLYMCGEFLWKSERYELIADVNKPIIAV 

VLEESAVSDDVVSPDEEGICSGKYFTESGLVGLLEQ7VAA3FSMAGMYEAVNEVYKVLIPI 
VLEESWSEDTLSPDEDGVCAGQYFTESGLVGLLEQAAELFSTGGLYETVNEVYKLVIPI 



IT AM 



IT AM 



IT AM 



IT AM 



YEKRRD- 



YEKRROFERLAHI YDTI HRA YSKV TEVMHSGRRLLGl fcFRtf &FFGQAAQYQFTDSETDVE 



SMKS GGTLET THI YDTI HRP YSKV TE VI 



YENRREFENLTQV YRTI HGA YTKI LEVMHTKKRLLG 
FEKQRDFKKLSDI YYDIHRS f LKV AEWNSEKRLFG 
HEANRDAKKLST 3 HGKI QEA ESKI VHQSTGWERMFG 
LEAHREFRKLTLT HSKI QRA EDS I VNKDH — KRMFG 



TR A- AGS WDLL PGGL FGQ 

TFFRVAFYGQ 



■PYYRVAFYGQ 
TYFRVGFYG- 
•iYFRVGFFG- 



ITAM IT AM 
-FFEDEDGKElYIYKEtfKLTPLSEISQRLLKl^SDNFGSEm 

GFFEDEDGKE YIYKEF KLTFLSEISQRLLKL YSDK FGSENVKMIQDSGKVNPKDLDSK YA 
GFFEDEDGKE YI YKEFKLTPLSEISQRLLKI YSDK FGSENVKMIQDSGKVNPKDLDSK FA 
S FFEEEDGKE YIYKEP KLTGLSE I SLRLVKI YGEK FGTENVKI I QDSDKVNAKELDPK YA 
G FFEEEEGKE Y I YKE P KLTGLS E I SQRLLKI YADK FGADNVKI IQDSNKVNPKDLDPK YA 
TKFGDLDEQE FVYKEP AITKLAEI SHRLEGH YGEP FGEDWEVIKDSNPVDKCKLDPNKA 
SKFGDLDEQE FVYKEE AITKLPEI SHRLEAF YGQC FGAE FVE V I KDS T P VDKTKLD PN KA 



IT AM 



YIQV THVI P FFDEKELQERKTE FERSHNI RRFM FEMP FTQTGKRQGG VEEQCKRRT I LTA 
YIQVTHVIPFFDEKELQERKTEFERSIiNIRRFMFEMPFTQTGKRQGGVEEOCKRRTILTA 
YIQVrHVTPFFDEKELQERKTEFERCHNIRRFMFEMPFTQTGKRQGGVEEQCKRRTILTA 
HIQVTYVKPYFDDKELTERKTEFERNHNISRFVFEAPYTLSGKKQGCIEEQCKRRTILTT 
YIQVrYVTPFFEEKEIEDRKTDFEMHHNINRFVFETPFTLSGKKHGGVAEQCKRRTILTT 
Y I Q I TYVEP YFDT YEMKDRI TYFDKNYNLRRBKYCT PFTLDGRAHGELHEQFKRKT I LTT 
Y I Q 1 1 FVE P Y FDE YEMKDRVT Y FEKN FNLRRFMY T T P FT LEGRPRGELHEQYRRNT VL T T 




Coiled-Coil 




IHCFPYVKKRr^MYQHHTDUNPIEVAIDEMSKKVAELRQLCS^fffiVDMIKLqLKLQGSV 
I HC FP YVKKR I PVMYQHH TDI >NP I EVAI DEMSKKVAE LRQLC S S AEVDM I KLQ LKLQGSV 
I HC FP YVKKR I PVMYQHHTDI .NPI EVAI DEMS KKVAE LHQLC S S AEVDM I KLQ LKLQG S V 
SNS FP YVKKR I P I NCEQQINI ,KP I DGATDE IKDKTAELQKLC S S TDVDMI QLQ LKLQG WV 
SHLFPYVKKRIQVISQSSTEI.NPIEVAIDEMSRKVSELKQLCTMEEVDMISLO LKLQG SV 
SHAFPYIKTRVNVTHKEEI ii.TPIEVAIEDMQKKTQELAFATHQDPADPKMLQ tfVLQGSV 
MHAFP Y I KTR I S V I QKEE FV1 1 T P I EVAIEDMKKKTLQLAVAI NQE PPDAKMLQ tfVLQGSV 



Coled-Coil 2 

S VQVNAG P LAYARAFL DDTNTKR Y P DNKVKLLKE VFRQ FVEACGQAllAVNERL I KE DQLE 
SVQVHAGPIAYARAFLDDTNTKRYPDNK\TCLLKEVF^ 

SVQVKAGPLAYARAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQAI AVNERLI KEDQLE 
SVQVNAGPLAYARAFLNDSQASKYPPKKVSELKDMFRKFIQACSIAI ELNERLIKEDQVE 
S VKVNAGPMAYARAFLEE TNAKKYPDNQVKLLKE I FRQFADACGQAI DVNERLIKEDQLE 
GTTVNQGPLE VAQVFLSE I PS DPKL FRHHNKLRLC FKDFTKRCEDAI RKNKS L I GPVQKE 
GATVNQGPLEVAQVFLAE I PADPKL YRHHNKLRLC FKE FIMRCGEA\/ EKNKRL I TADQRE 



Coiled-Coil 2 



YQEEMKANYREMAKELSEIMHEQICPLEEKTS-VLPNSLHIFNAISGTPTSTMVHGMTSS 

YQEEMKANYREMAKELSEIMHE QLG 

YQEEMKANYRE IRKELSDI I VE RI CPGEDKRATKFPAHLQRHQRDTNKHSGSRVDQFI LS 
YfiEGLKSNFRDMVKELSDIIHEQILQEDTMHSPWMSNTLHVFCAISGTSSDRGYGSPf^^ 

YQEELRSHYKDMLSELSTVMNEQITGRDDLSK RGVDQTCTRVI SKAT PALP TVS I S S 

YQRELG KLSS PZ 

YQQELKKNYNKLKENLRPMIEF KI PEL YKP I FRVE S QKRDS FHRS S FRKCETQL S QGS Z- 



PBM 

Isswte - 



CVTLPHEPHVGTCFVMCKLRTTFRANHWFCQAQEEAMGNGREKEPWTVIFNSRFYRSWGK 

llyfc-- 

[SAE\| Z 



VHIFF 



B 



CLASP - 1 
KIAA1058 
CLASP -2 
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1 

A 



2 

GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT 
val leu his his his gin asn pro glu phe 

bE 

CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC 
gin leu his glu lys his his leu leu leu 

1EE 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC 
ser lys gly ser thr lys lys arg asp val 

1AE 

CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA 
Dpro leu leu lys asp gly arg val val thr 

Q1EME 

QKTT CCT TCG GGC TAT CTT GGC TAC CAA GAG 
SJleu pro ser gly tyr leu gly tyr gin glu 

§7i 30E 

Cl ATT AAA TGG GTA GAT GGA GGC AAG CCA CTG 

J* ile lys trp val asp gly gly lys pro leu 
f, a ref 1 • It 1 • E and 1*3 

El 3bE 1 

H GTG TAT ACT C AG GAT CAG CAT TTA CAT AAT 
ft val tyr thr gin asp gin his leu his asn 

O qae 

O GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA 
gly ala gin ala leu gly asn glu leu val 

MAE 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT 
gly his val met ile ala phe leu pro thr 

SME 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC 
arg ala thr gin glu glu val ala val asn 

bOE 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC 
gin cys his glu glu gly leu glu ser his 



bbE 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG 
ala glu pro tyr val ala ser glu tyr lys 



3E 

TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
tyr asp glu ile lys ile glu leu pro thr 

IE 

ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
thr phe phe his val ser cys asp asn ser 

1SE 

GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
val glu thr gin val gly tyr ser trp leu 

EXE 

AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
ser glu gin his ile pro val ser ala asn 

E7E 

CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu gly met gly arg his tyr gly pro glu 

33E 

CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
leu lys ile ser thr his leu val ser thr 

31E 

TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
phe phe gin tyr cys gin lys thr glu ser 

MSE 

AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
lys tyr leu lys ser leu his ala met glu 

SIE 

ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
ile leu asn gin leu phe arg val leu thr 

S7E 

GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
val thr arg val ile ile his val val ala 

b3E 

TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
leu arg ser tyr val lys tyr ala tyr lys 



b^E 

ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
thr val his glu glu leu thr lys ser met 



722 752 

ACC ACG ATT CTC AAG CCT TCT GCC GAT TTC CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 

thr thr ile leu lys pro ser ala asp phe leu thr ser asn lys leu leu arg tyr ser 

762 file 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT ATG GCT CAG CAT TTG ATA GAG AAC TCC AAA 

trp phe phe phe asp val leu ile lys ser met ala gin his leu ile glu asn ser lys 

6MB ICadherin Cleavage! 672 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT GCA TCC TAT CAT CAT GCA GCG GAA ACC GTT 

val lys leu leu arg asn gin arg phe pro ala ser tyr his his ala ala glu thr val 

IDE ^32 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 

val asn met leu met pro his ile thr gin lys phe gly asp asn pro glu ala ser lys 

1L2 <H2 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 

Gbsn ala asn his ser leu ala val phe ile lys arg cys phe thr phe met asp arg gly 
J3 ref 2»1 

g! f D22 1052 

CyTTT GTC TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 

phe val phe lys gin ile asn asn tyr ile ser cys phe ala pro gly asp pro lys thr 



yflDa2 1112 

CjCTC TTT GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 

_ leu phe glu tyr lys phe glu phe leu arg val val cys asn his glu his tyr ile pro 

,™11M2 1172 

rfTTG AAC TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 

leu asn leu pro met pro phe gly lys gly arg ile gin arg tyr gin asp leu gin leu 

yi2D2 1232 

UGAC TAC TCA TTA ACA GAT GAG TTC TGC AGA AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 

asp tyr ser leu thr asp glu phe cys arg asn his phe leu val gly leu leu leu arg 

12t>2 1212 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 

glu val gly thr ala leu gin glu phe arg glu val arg leu ile ala ile ser val leu 

■ ref 3 . 1 

1322 13S2 i 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT GAC AGA TAT GCT TCA AGG AGC CAT CAG GCA 

lys asn leu leu ile lys his ser phe asp asp arg tyr ala ser arg ser his gin ala 



1362 1412/471 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 

arg ile ala thr leu tyr leu pro leu phe gly leu leu ile glu asn val gin arg ile 

1442 1472 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 

asn val arg asp val ser pro phe pro val asn ala gly met thr val lys asp glu ser 

1502 1532 



CTG GCT CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG CAG AAG GGA AGC ACC CTG G AC A AC 
leu ala leu pro ala val asn pro leu val thr pro gin lys gly ser thr leu asp asn 

■ ref M-1 and M . E 
15b5 4 

AGC CTG CAC AAG GAC CTG CTG GGC GCC ATC TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 
ser leu his lys asp leu leu gly ala ile ser gly ile ala ser pro tyr thr thr ser 



lbE2 lfc.SE 

ACT CCA AAC ATC AAC AGT GTG AGA AAT GCT GAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 

thr pro asn ile asn ser val arg asn ala asp ser arg gly ser leu ile ser thr asp 

ref 5.1 and 5*E 

lfc.SE 171E 

TCG GGT AAC AGC CTT CCA GAA AGG AAT AGT GAG AAG AGC AAT TCC CTG GAT AAG^CAC CAA 

ser gly asn ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys his gin 



17ME 177E 

CAA AGT AGC ACA TTG GGA AAT TCC GTG GTT CGC TGT GAT AAA CTT GAC CAG TCT GAG ATT 

gin ser ser thr leu gly asn ser val val arg cys asp lys leu asp gin ser glu ile 

plflDS 1A3E 

AGC CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 

Jlys ser leu leu met cys phe leu tyr ile leu lys ser met ser asp asp ala leu phe 

pifibS IBIS 

Jj^CA TAT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 

~Mthr tyr trp asn lys ala ser thr ser glu leu met asp phe phe thr ile ser glu val 

| ref b-1 

Ml^ee J nsa 

s TGC CTG CAC CAG TTC CAG TAC ATG GGG AAG CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 

M=cys leu his gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 



O^flE SD1E 

•GGA CCC ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 

gly pro ile val his asp arg lys ser gin thr leu pro val ser arg asn arg thr gly 

EDMS ED7E 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 

met met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn his 

E10E E13E 

AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 

ser tyr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn ile ala 



ref 7-1 

ElbE SITE 

ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
thr glu val cys leu thr ala leu asp thr leu ser leu phe thr leu ala phe lys asn 



EESE ESSE 

CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 

gin leu leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 

EEflE E312 



TGT TTT CTT C A A AAA CAT CAG TCT GAA ACG GCT TTA AAA AAT GTC TTC ACT GCC TTA AGG 
cys phe leu gin lys his gin ser glu thr ala leu lys asn val phe thr ala leu arg 



23M2 H372 

TCC TTA ATT TAT A AG TTT CCC TCA ACA TTC TAT GAA GGG AGA GCG GAC ATG TGT GCG GCT 
ser leu ile tyr lys phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

2M02 2432 

CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyr glu ile leu lys cys cys asn ser lys leu ser ser ile arg thr glu ala 

2Mb2 2M12 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
ser gin leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 

2522 2552 

GTC CGG ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
val arg thr his leu gin val ile ile ser val ser gin leu ile ala asp val val gly 

Ug5fl2 2bl2 

yfipTT GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 
Ale gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

r4GG CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
fjarg leu ile lys his thr ser phe ser ser asp val lys asp leu thr lys arg ile arg 

^702 2732 

; ACG GTG CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
Jbhr val leu met ala thr ala gin met lys glu his glu asn asp pro glu met leu val 

N27fc,2 2712 

ySAC CTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
Oasp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2B22 2652 I xxxxxxxxxxxxxxx Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

Transmembrane Domain xxxxxxxxxxxxxxxxxxxxxxxxxx I 

TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA GGC GTG TTT AGA CAA 
tyr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

21M2 2*572 

GGA TGC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 

gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 

ref 6-i| 

3D02 J 3032 

GAC GTG GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT GTG CTG ATG GAG CTC CTT GAG CAG 

asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 

30b2 3012 

TGC GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 
cys ala asp gly leu trp lys ala glu arg tyr glu leu ile ala asp ile tyr lys leu 



_ ref T-l 

3122 I 3152 

ATC ATC CCC ATT TAT GAG AAG CGG^AGG GAT TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
ile ile pro ile tyr glu lys arg arg asp phe phe glu asp glu asp gly lys glu tyr 



31fl2 3212 

ATT TAC AAG GAA CCC AAA CTC AC A CCG CTG TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 

ile tyr lys glu pro lys leu thr pro leu ser glu ile ser gin arg leu leu lys leu 

ref 10-1 

32M2 3272 

TAC TCG GAT AAA TTT GGT TCT GAA AAT GTC AAA ATG ATA CAG GAT TCT GGC AAG'GTC AAC 

tyr ser asp lys phe gly ser glu asn val lys met ile gin asp ser gly lys val asn 



3302 3332 

CCT AAG GAT CTG GAT TCT AAG TAT GCA TAC ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
pr lys asp leu asp ser lys tyr ala tyr ile gin val thr his val ile pro phe phe 

33b2 3312 

GAC GAA AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
=asp glu lys glu leu gin glu arg lys thr glu phe glu arg ser his asn ile arg arg 

S3M22 3MS2 

JjTTC ATG TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 

^jane met phe glu met pro phe thr gin thr gly lys arg gin gly gly val glu glu gin 

J ref 11-1 
3S12 

*TGC AAA CGG CGC ACC ATC CTG ACA GCC ATA CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 

^cys lys arg arg thr ile leu thr ala ile his cys phe pro tyr val lys lys arg ile 

35M2 3572 Ixxxxxxxx Coiled-coil 1 xxxxxx 

CCT GTC ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
pro val met tyr gin his his thr asp leu asn pro ile glu val ala ile asp glu met 

3fc»02 xxxxxxxx Coiled coil 1 cont'd xxxx 3b32 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
AGT AAG AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
ser lys lys val ala glu leu arg gin leu cys ser ser ala glu val asp met ile lys 

I ref 1B-1 

CTG CAG CTC AAA CTC CAG GGC AGC GTG AGT GTT CAG GTC AAT GCT GGC CCA CTA GCA TAT 
leu gin leu lys leu gin gly ser val ser val gin val asn ala gly pro leu ala tyr 



3722 

GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA 

ala arg ala phe leu asp asp thr asn thr 

3762 

CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA 

leu lys glu val phe arg gin phe val glu 

3fiM2 xxxxxxx Coiled coil 1 2 xxxxxxxxxx 

CGT CTG ATT AAA GAA GAC CAG CTC GAG TAT 

arg leu ile lys glu asp gin leu glu tyr 

3T02 xx xxxxxxxxxx xxxxxx xxxx xxx xxxxxx xx 



3752 

AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 

lys arg tyr pro asp asn lys val lys leu 

3A12 1 xxxxxxxxxxxxxxxxxx 

GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 

ala cys gly gin ala leu ala val asn glu 

3fl72 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 

gin glu glu met lys ala asn tyr arg glu 

3=132 xxxl 




ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG ATC TGC CCC CTG GAG GAG AAG ACG 

met ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 

3Tb2 3^2 

AGC GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 

ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser thr 

4022 Ixxxx PBM xxxxxl 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 

met val his gly met thr ser ser ser ser val val STP 

4062 4112 

TGG GGA CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 

4142 4172 

ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 

4202 4232 

TCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 

□ 2bE 42 e )2 

JfCA ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 

0*1*322 43S2 

-JTT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 

rV3fl2 4412 

;£TT CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG GGG GAC CTT TTG 



ref 13.1 

3442 J 4472 

<CT CGA CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG TAC AGA ACT TAC TAG TTT TGT CTA 



MSQE 4532 



UGGA GTA TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 



45b2 45^2 

TAG TGA GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG CTA CTG GCT TCA AGT CAG AAC TTT 

■ ref 14.1 

4b22 4bS2 I 

GTC ATT AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT ATT ACA TTT CTA CAT TTT TAA TAC 



4b&2 4712 

TCA CAT GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 
4742 4772 

ATA CAC TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 

4602 
TTT ACT 




BAC sequences of Human CLASP 2 
Refl.l 

Sequence of BAC4 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is underlined and represents nucleotides 356-375. 

tttctracagngt^actcag gtatgtgctccttcaacaaaattagcagttgctgntctn 

tgacaaagtttgcaccattttgcaagaagaaaaaaatcctaatgtgttatattactata 

tttttactctatagatctttttctaaagaaagaaagtacaactgaagtgcttatatgta 

ttcatataaatgactagtacaagcatcattttgcaacagatttcccctttcattggagg 

atcttcttgatgttatttgtacacgatcaatttttagtcttaataagatgaggctgggtg 

tggtggctcacacctgtaatcctagcattttggaggccaaggtgggcagatcactttag 

cccaggggtttgagaccagcctggccaacatggcaaaaccttgtctctacaaaaatac 

naaaattatccaggcatggtgatgtgtgcctgtagtcccaactncctaggaggctagg 

ggtagggggatttgcaagaggctgggagggtcaaagcccnaantgagccattggtnc 

atgtcacitggaccccaagcnnggggngancaagagcaaaggact^tgtnntttan 

aaaaaaaaccgggctaccatacnnaccaacccncnnacctacccnacctttccanntt 

l 4aaanaaggctttgncttgcanaggaaaancaaaatnncc 

-^Sequence of BAC26 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
essquence is underlined and represents nucleotides 351-375. 

f§(ttggtttctacagtgtatactnag gtatgtgctccttnaacaaaattagcagttgctg 
^tctgtgacaaagtttgcaccattttgcaagaagaaaaaaatcctaatgtgttatatta 
=ctatatttttactctatagatctttttctaaagaaagaaagtacaactgaagtgcttat 
^tgtattcatataaatgactagtacaagcatcattttgcaacagatttcccctttcatt 
^gaggatcttcttgatgttatttgtacacgatcaatttttagtcttaataagatgaggc 
Htgggtgtggtggctcacacctgtaatcctagcattttggaggccaaggtgggcagatc 
^ctttagcccaggggtttgagaccagcctggccaacatggcaaaaccttgtctctaca 
y^aaatacaaaaattatccaggcatggtgatgtgtgcctgtagtcccagctacctagga 
^ggctagggtagggggattgcaagaggctnggaggtcaaggcccgcagtgagccatgg 
tcatgtcactgcacccccagccagggccgacaggagcaagactnttgtntcaaaaaaa 
aacagnaaccaacanccaacaacaacaacnacctttcngcaaaanaagcttgctnca 
angaaaccaaaatgncttcttnttttcccccn 

Ref 1.3 

Sequence of BAC26 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is not found within this sequence. This sequence most likely represent intron sequence 
since this sequence matches the intron sequence found in the previous two BAC sequences. 
AGWWlSrNCC<^CTACNCCACTTTTAACCTTTTGAA 

GCTCCTTCACATATTAGCAGTTGCTGCTCTGTGACATAGTTGCACCATTNTGCAAGAAG 

AAAAAATCCTAAGTGTNATATCACTATATNNNTACrCTATAGATCTTNTCTAAAGAAAG 

AAAGTCAACTGATGTGCITATATGTATNCATATAAATGACTAGTACATGCATCATTTTG 

CAACAGATNTCTCCTCACATTGGAGGATCTTCTNGANGNATTCGACACGATNANTATTA 

GTCTNAATAAGATGANGCTGGTGTGGNGGTACACTGNATCTAGCATNTGGANGCATGT 



t 



GGCAGACACTTANCCNCGGTNGAGACAGCTGTCACTGNCNAACTGTCTCTNTAAANCA 

AANNCTCCGCNGGNGATGGGCTGAGCCAGTCCTAGNNGCTAGNTAGNGATGNNGAGN 

TGTNGCACGNCGAGNGAGCATGNTCTGTACTGACTCATCAGGCGNCNACACGNTCTGT 

TCNAAAACATACCACACACACTCNCACCTNCGCAAAATTGCTCTNNAAANATGCTTNT 

TTCACACNGNTNCAATCNCTATATNNTCTTCTATTGTNCWACGTNTNATTANNATCTTN 

CNCTGCANAACNATNCGNCCACCTTSTNANNACCTTANGCTTNGTTTCACGCTTATAGCTC 

CCCTACACNTNNCAGOSINTTNCNNGTGAAGGGCCNCCCGAATCTACGANCATACTCTC 

TCCGTATATNGCCTCGGTCANCGCCATCTGCTGTNTNCTCNTCNCTNGCNNTTNANCNG 

TNCGCTATCTCTNNNCCGGATCCNCNCCATATNNTNNCTCTACTTANAGCGTAANNTNT 

NCNCNCACTANTCACAACTTNTNCNTNNAACTCTATCTNCTCCTCTCTACCACCTCAGT 

TACTACCTNTTCACNCANTCTCCTTCNCTNTCCACTGATGTCCACATAGCTGCTNTACTC 

GCCANTTTATCATATNCACACNCTCTACGCTNNNTNT 

Ref 2.1 

Sequence of BAC4 using primer HC2S1, which spans nucleotides 1 107-1 126 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1079-1097. 

cttgtattnaaagagggtc tgcaggaagaagtgtgtagtcataaatacctcactggat 

^ttttatacaggattctaaaaaacctattagcaatagtatgctagaaatagtcattagc 

fecttgaccttcttagaactgcacactctattgcactgtacagatttcaggatggctgc 

Sgggattgatttgaaaactaaggacacatttcaataaacaatgtcttcaattgattttt 

sigggctcctcctacttcaatgaaggacttcaggtagcttataattacagacacaggctc 

2^tacaataaaaaaattagtaaggcagagctttaaaaaaaaaaaaggaaaaagataa 

Wtctacgagagaaaggctacatggtgacttctgttaccagtaacaacccccgcactacc 

yttgggtctccaggagcaaaacagctaatgtagttgttgatctgcttgaagacaaagc 

©cctgtccatgaaggtgaaacatctctgtggaggaaaacaagcaauaaaagttatttca 

^gtccaaacatttcggaaaittggattcaaagcaggcatttattgctaataagtttatc 

gactgacataaaaaacatgccttcaacattgccagagcacctactctattntagtcncn 

{Ref 3.1 

Frequence of BAC4 using primer C96AS, which spans nucleotides 1443-1452 of the cDNA. Exon 
frequence is underlined and represents nucleotides 1370-1422. 

" AATCAGCAGACCAAACAGAGGCAGGTAGAGGGTGGCTATCCTTGCCTGATGGCTCT GA 
AAAGAAGACACACATGGTAAGTTTGACCCAGGATTCTGAGAACCGAACTAAGTTGGTG 
CTGACCATCTCCITTATTTGGATCCTTCCTATAAAGACAGATATTTGATTTTAGTCCCAA 
AATAGAGCAAAATCTTAGTGCTGTTACCATGAATTTTCTAACTGATTACTTTCTTTACAC 
CACTTAAAATAAAGGACATTATCAATGCACATTCCTTCCATTGGGGACCACTCACCCTT 
GAAGCATATCTGTCATCAAAAGAATGCTTTATCAGCAGGTTCTTGAGCACACTGATGGC 
GATCAGACGGACCTCCCGGAACTCCTGGAGGGCTGTCCCCACCTCCCTNAGTAACAGT 
CCCACCAAGAAGTGGTTTCTGCAGAACTCATCTGTTAATGAGTAGTCAAGCTGGGAGG 
TCTGAAATGAGGATAGAAACTACTTTGNGTTAGGAAAGATGCAATGCTCTTTTGAATA 
AAACAAACAAACCAAACNAACAAAAAAAAAACTAAGACCCATCCTTNTGNATTTCAA 
GCCCACCCTGGGGTNGGTCAAAGAGATGATCAGNANTTTGGCNTTNAAATGAAGAAAG 
AAATNAATTNTCCAGGGGNTGTTCTNCTTTTTAGCACANGGAGGGATNTTAANTGAAA 
ACGAATTTAAATCCAATTNAGGNG 

Ref 4.1 



Sequence of BAC4 using primer C2AS5, which spans nucleotides 1716-1735 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1602-1703. 

TTCCTTTCTGC, A A GGCTGTTCCC.G A ATCTGTGCTT ATG AG AG ATCCTCTCG A A TC AGC A 

TTTCTCACACTGTTGATGTTTGGAGTTGAGGTTGTATATGGAGAAGCT AAATGGAAATC 

AAGCCAACAATAAAGTTTTATTAAGACAGAACAAAATAAAGATGAGTACTGAACTTTA 

AGGGAAATTGCTTTTATTGCACTTATTTTTTCTGTTAGGAAGTTGGCTCAAGAGTTGCAT 

TCCATTACTTCACCTTTAAAGAACCAGGTCATATACAATGAGATAAAAAGAAACTAGT 

CTGAAACATTCAGATGTAAACATCAATTCACTTGTTAGAAACCACCTTTGATCGCTAAA 

GACTAAATGCATACCTGTTTCAGAATGTGATAGAATGAAGACTTAAAAAAATTAAAAG 

ATAAATCCACCTAGAACTATCAAATCACAAAATTAAACCACACAACAAACTTGTAGCA 

TTCAAACTGGTAATAAACACTGAGGAGCCTACCCAACTCTGAGGGGTGTCATGGGGTA 

TTTTAAATTTTCGAGGAGAACACAGTGATATGTGACCTCAGCCAGAAGCTGCTGTTTNA 

GCAGCAGGTTGGTGCTATGCTCCTTTTTGAAGACATATTTGTGAAGCTGGGTATTTTGG 

GGGGCCTGCTTATGATAAAANGGCAAGGTNTTCAATGNAGGGGN 

Ref4.2 

Sequence of BAC26 using primer C2AS5, which spans nucleotides 1716-1735 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1602-1703. 

grCCTTTCTGGAAGGCTGTTACCCGAATCTGTGCTTAT GAGAGA TCCTCTCGAATCAGC 
^TTTCTCACACTGTTGATGTTTGGAGTTGAGGTTGTATATGGAGAAGCT AAATGGAAAT 

SAAGCCAACAATAAAGTTTTATTAAGACAGAACAAAATAAAGATGAGTACTGAACTTT 
:A^AGGGAAATTGCTTTTATTG 

StTTCCATTACTTCACCTTTAAAGAACCAGGTCATATACAATGAGATAAAAAGAAACTA 

©TCTGAAACATTCAGATGTAAACATCAATTCACTTGTTAGAAACCACCTTTGATCGCTA 

MAGACTAAATGCATACCTGTTTCAGAATGTGATAGAATGAAGACTTAAAAAAATTAAA 

S AGATAAATCCACCTACAACTATCAAATCACAAAATTAAACCNCACAACAAACTTGTAG 

fcATTCAAACTGGTAATAAAACACTGAGGAGCCTACCCAACTTTGAGGGGTGTCAATGG 

^GTNTTTTTAAATTTTTCGNGGGANANCCCAGTGNTATGGTGACCTTCACCCAAGAAGC 

HTGTTTGTTTNACCAAGCNAGGTTGNNCTNTGCTCC^ 

^AATNCTGGNTTTTTTNNGNGGCCCCCTNCNTTNNT 

lief 5.1 

Sequence of BAC4 using primer C2S6, which spans nucleotides 1686-1705 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1724-1736. 

TTCCTGGATAAG GTAATTGCTTTTACCCAACACAAATGTTTCTTATAATCAATGGATTT 

AGCCCAAAGTAAACGTACTTCATGTTCTAGTGCCTTTTAAGTGTGACCTTTTGTTTTTTT 

CTAAACCACCCGGCTGACCTGGAGTAGGTGATGAGAGCTTTAAGGTTGGGGCCCATTC 

CTTGAAGTGCTCTGATTCCTGTTTCCAGTACCTCAGATCCTGGGCAGGGTTTGCAGTGG 

AGCGTCTTGAGTGAATGGCTCTGGTGGGTTGAACGGGGAGGGACTCAAAATGCTGCCC 

ATCTCAATTTCCTGTAGTCTTTTTATTTATTTATTTATTTTTTGA 

GTCGCCCAGGCTGGAGTACAGCGGCACGATCTCAATTNACTGCAACCTCCGCCTCC:TG 

GGTTCAAACGACTCCTCTGCCTCAGCCTCCCCAGCAGCrTGGGACCACAGGCACAAGCC 

ACCACCGCCCGGCTAATTTTTTGTNTTTTTAGTA:GAGAT:GGGGTTTCACCATAT^ 

CAGGCTGGGCTCAAACTCCTGACC:TCGTCATCCGCWGCCTCGGNCTNCCAAAGTGCTT 

GGGATTNCAGGCNGTGAGCCCACTTACACCTNGGGCAATTCCCTGTNAGTCTTTTTTAC 

CAGAGACACCATCATTCAACACAGCTTTTCCACCCACAA 



Ref 5.2 

Sequence of BAC26 using primer C2S6, which spans nucleotides 1686-1705 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1712-1736. 

TG A G A AG AGC AATTTCCTGG ATAAG GTA ATTGCTTTT ACCCAAC ACAAATGTTTCTT AT 

AATCAATGGATTTAGCCCAAAGTAAACGTACTTCATGTTCTAGTGCCTTTTAAGTGTGA 

CCTTTTGTTTTTTTCTAAACCACCCGGCTGACCTGGAGTAGGTGATGAGAGCTTTAAGG 

TTGGGGCCCATTCCTTGAAGTGCTCTGATTCCTGTTTCCAGTACCTCAGATCCTGGGCA 

GGGTTTGCAGTGGAGCGTCTTGAGTGAATGGCTCTGGTGGGTTGAACGGGGAGGGACT 

CAAAATGCTGCCCATCTCAATTTCCTGTAGTCTTTT^^ 

AGAGTCTCGCTCTGTCGCCCAGGCTGGAGTACAGCGGCACGATCTCAATTCACTGCAA 

CCTCCGNCTCCCTGGGTTCAAACGACTCCTCTGNCTNAGNCTCCC:AGCAGCCTGGGAA 

CCACAGGCTCANGCCACCACGCCCGGCTAATTNTTGTAATTTTNAGTAANAAATTGGG 

GGTTCTCACCATNTTGGCCCAAGNCTTGGGCCTAAAAACCTTNCTNAGCNTCGNCATTC 

NCNCCCCNACCNTGGGCNCTNCTCAAANGNGCTTGGGGATTTANCANNGGCNTTAACC 

CCCCNTATCACCGTGGNCCTTAATTT 



Ref 6.1 

Sequence of BAC4 using primer C2S7, which spans nucleotides 1918-1937 of the cDNA. Exon 
sequence is not found within this sequence.Since the primer is directed against exon sequence we 
presume that sequence derived from C2S7 is intron sequence. 

pagngngggtttnagncgtttgaagcctgnnacgnggtgngtggtngaactctgtggg 

)#ittcaggtactggggtatctgggagcctgctgtttgcattgctagtgcatcagaccag 

ggctttttcctccctgtagctgctacttatacacatagctctaactgagatgattctcca 

feacaactgatgcagagcagcaaaagcttctgccgttctccccttctaggagtgtctcct 

Stctttggaaagagatcatgaggggctagattgtaatgaagtgaggctcagtgcttga 

fficacatccggtaaaagttccaatatattggtcataaagtttctcattctttatagcagt 

^aatttctctggctcatgagttttcttagttt^ 

scaccagtcatatccccagggcaaactcaaaggcatgagaggccagactcgggtcctg 

©TCATAGCAACCCCTGTCTAGGGCCTTGGTCCCTGCCTCCGCTTGTGTGCTGTGGCGCA 
GGTCCTATGGGCCCTTAGGAAACAGGACCACCCTGTCGCACCCCCTACAGAGACCAGC 
CAAGTTTGACATTAGATCACCGTAGCAATGTNTGCAAATTCCAGTTTCTTGCTAAAACA 
GGTTAAGCCTTGCAGCCACTTTATCTGTAACTGGCNGAGGTTTTGACATAAAA 

Ref 7.1 

Sequence of BAC4 using primer C2S8, which spans nucleotides 2143-2162 of the cDNA. Exon 
sequence is underlined and represents nucleotides 2182-2219. 

CTCTCGACACGCTGTTTCTATTAACATTGGCGTTTAAG GTTTGTATCAATTTGCTGTTCG 

NGGTTCTAGTTTTACCTTTCACATTCATTCTGCTTGGTAAGCTCAGTGAGCACAAACTTA 

CTATGTTGCATTTTTACTTCAGCAATTATTTTTGTCCCTGTAAGGAAACCATTAATCTTT 

AAATTCCTTTAATGAAATCATTCCACAGTGAATGGCTTGAATGCCCtGAAATAAAATTT 

AACTGGTCAGTGTGTGCTGCGCGCTTGGGTATGGTGGAAACACGGTCTCTGGAGGCAG 

TTAACTCTTGGCTCGAACCTTGAGGATGGTGAATATAGGCACCTAATCAGGCATTTCTG 

CCTTGAATATCTTTAAATATATCCAAATGTTATAGCGTTTAATTAGATTTTTATGTAGAA 

AGGAGCAATAAACACAAGACACATGTTTTCAGTTTTTTATCTGTTACTGCATTAAATGA 



TAAAAACGTTTTGG AG AT AGAAAATG AAAGGGGT lTlTlTlTl GTCTTGTTTT AAAGTT 
TTAGCAAATAATATTCAAGTAGGTGGAGATGGACTCTTCACCACTCTCCTGTTTTTAGG 
AACCCAATACTTTTTCATTCTTGCTAAATGATTACTTCCATTTCTAGCATAGAAAAGGA 

GAAAATTGGAATGAGTGTTTATAT 
Ref8.1 

Sequence of BAC4 using primer C2S9, which spans nucleotides 2992-301 1 of the cDNA. Exon 
sequence is not found within this sequence. Since the primer is directed against exon sequence we 
presume that sequence derived from C2S9 is intron sequence 

CGCTTTNAAATNCCAGCCGCTACTGCGGGGCG>rmAATTCGAAACGTGTTGTTNTCTGT 

GATGCCTGGCTCTGATTGTGTGGGATTGGTCATCAGTGGCGGTTGGCAGNTGGGGTTCA 

TGGAAGCGGCCATGGGGACTGATGGCAGGCCCTTGGATTGCCACCGCAGAGCCTGGCA 

GTGTCTTTGGTCTGCATTCCTACCGGCGAAGTCTCATTTCACCTCACGTGTTATCTCTTG 

GAAAGCATTCCTTTAGGGGGCTGTGTCTACCCTTCCATCCTCTCGTCCAAACTCCCCCTC 

CTTCTCTGTTGTGTCTCCTTCCCATCCTCTTCTCCCCAGTTCTTCTTCCTATGTTCCTTCCT 

CAGTGGTTTCTCTTCCTCTGTTTGACTTTCCAAGGTCATTTTGACTGTTCCTGCTCCCAA 

CTACAAAGATACTAAAATCTCACCTAACCACTCTTCTTCTTTCTTAATGAAAGAATGTT 

O'CAGTCCATCCCAAATTTGTGTGGACTTCACAAACCTTCrCTAAAATGGAGCCTTTTCT 

©rTCCTACTCTTGACTAGNTGGTAAACGCTCCATGTTCTTGGCCAGAACTCCCTGGTGA 

1TAGCGTCACTCCCACTTTCCTGTGCAGAACCAAGCCTCCTAGAAAACTCCTTTGCANC 

§|3AGTGGGTTGGGACACGCCCTTTmTTGGG 
JMef9.1 

Sequence of BAC4 using primer C2AS10, which spans nucleotides 3276-3295 of the cDNA. Exon 
sequence is underlined and represents nucleotides 3 147-3234. 

■TTTANACCNATNTATCCGNGTCAGTTANAGGAG TCTCTGAGAAATTTCCGACAGCGGT 
^TnAGTTTGGGTTCCTTGTAAATATACTCCTTTCCATCTTCATCTTCAAAGAATCCCT GT 

Sgacataaagcacaattagagctatccctgaacgtaagcccagggcttaccacctagga 
fagcgttcttttattacaagggggaaaaaaaggaatgggtctaaaaatccagctgaaat 
©ggctttctgaatgagaaagaaaatgctaataacatgaagtctaggtgcaaaggtaaa 
ffigaaaaacacaacattgcaaacttattcaagaatgcagtcattaagtgttgagtgaaa 
tgaaagattttggatacaagactaagctgtcccagggaagtctaatgggagtcaagcc 
tgtttcactttcccaagaagcagaactcactanaaaatgatgagcagcccacga cagg 
caggctcagaagtggacatgcctcccttctcctgatggctnccatgcacacaggatttt 
atggcatgaactgaagcgtttgggggtctggagtaagtttagtaaaagttaggtaaag 
cttgtataaattgtatttttgctttacccgatgagaaaaaaaatattnaagacctggta 
gcttcaatattcaagaaaaatatttttcatntcacccg 

ReflO.l 

Sequence of BAC4 using primer C2S 1 1 , which spans nucleotides 3 167-3 1 86 of the cDNA. Exon 
sequence is underlined and represents nucleotides 323 1-3296. 

ATTTGGTTCTTGAAAATGTCAAAATflATACAGGATTCTGGCAAG GTATT GACCAT GTTTG 
GANAAGTTTCATAGCAATGTAATGTTGTGATNCGATTACATATNATATATTTTTAAATG 
TNTATAGAAAAAAACACANGAAAAATATTAAGGATTGTTGGCCCGTGAGTGGCAGGTG 
TATNTTCTTNCTGATCCTTTAGNGCTTTCCATT^ 



TCGCCTAATTTTTGAAACATCTAATTTTACAAAATAATTAACCGTNTGGCCANGNATAT 

TNTCATTTTTAGGNCCAGCTATTTAGAAACTCTGACANAAATGAGGGGCTGTGGCTTNC 

CTNCCTNNACTTGNCCCTCTTTCMNGNATGTACCACATGAACTTGNCNCCTCTTTCNNC 

TNACCGGGTGGCATGTTANAGGACAGGTTGAAACCNCANTNGGGCNGGANTTNGGTN 

NAATTGGGACACAATGGTACNANGCTCTATNGGAATNGAAACTCTCCCNACNNNCNGT 

GNNCCNTGGGGAAAATGNGNCNNATTCATTTTN 

Refll.l 

Sequence of BAC4 using primer G2S12, which spans nucleotides 3474-3493 of the cDNA. Exon 
sequence is not found within this sequence. Since the primer is directed against exon sequence we 
presume that sequence derived from C2S9 is intron sequence 

AGNANNGTTNNGCAGCTGCANNTCTGGACCCANAGGCCGCANGGGCACGAGCCNGGA 

CACGCTCGGCAAAGAGCTGTCCAGAGGGATTCAGAAGCTTCAGGACTGGAAGGGTCTT 

TCGAGCTCAGTTAGCCACCCCCACACCCATTTCAGTTTCACATTTATCTAGTGCTTCCTT 

TTGAATACTTGGGATGTTTTTCTGTTGATCTGTTGGCACITCCTTCTTCCACAAGACCAG 

AAGCTCATATCCAATCTAAGGTCACTTACCCTTCTGAGAATCTGATGAAAATGGCGTGC 

CTTATGTGCCTAGATGCTTTTGCACACAGTCTAAGGTGACTTATGGAGTCCAGGTCCAG 

CAGCCACACCCAGTCCTGGGTCTCCGCACAGGGAGGGACCCGTCTTCACACACCTGTCT 

dAGGTTCTAGCATTGGGCTGCTTCAGCGGTCTCAGGCTGTGAGTAAATGGGATGTGAG 

©TTGGATCGCCCCACGCTGTTGNCCCCCGGGGGGCTTGGCCAGCTGGCCACTTNGAAAT 

©CCTCCTTTTGCCCAGGAAAGCTCACTGCATTTCAATGGGGNTTNTCCACGAAGTTCAN 

JfTTTANGGG 

tfief 12.1 

.Sequence of BAC4 using primer C2S13, which spans nucleotides 3645-3664 of the cDNA. Exon 
[sequence is underlined and represents nucleotides 3683-3699. 

(TN A A (IfrTTsTNCTC ANTN A ANN C A GCGTG AGNGTTC AGG TG AGCCAGGCAC AGC AGGC 
L£GGAGGGCAGCAGGGGACGTCCTTGCCCCTGGGTGACTTGAGAGTCGTTTCCACTAAC 
bAAGGTCTACTTGAGAGCCTCGGTTTACCAAGTGATCCCTGCTCCCTTCCCCCAACGTNT 
SjTGACATTTCTCCTGATATCAGAGGGGGAGGAAACCTCATGATCCCTGCCCCCCGCCCC 

s^tgaggactgactgtggggacaaagagccagatctcatagactaccctgatttgtcag 
tatttggggaattctgggtgcctgattagaagcatcaagactcttctaaatncaaaga 
agtgtggagagcagtagattttcctataaaactggtgttggtggtttctatgaaaattg 
gatccaaaaaaagtccttaagtttaccctcttaatggnatcttttgattaatggaattc 

ATTATTTTAATATAGCCCAATCAATCCAATTTTTCTTTATTGGTAGCATTTTTATGTTCTC 
TTTAAAAAAATCTTGGNCTACCrCCAAAATTTCACAGATGTTCTCCTAGGGTTTTCCTCC 
TTTTGGTTCAAGCATCCCATTCAANGTCTTGCAGTCCATTCTGGGG 

Ref 13.1 

Sequence of BAC4 using primer C2S14, which spans nucleotides 4289-4308 of the cDNA. Exon 
sequence is underlined and represents nucleotides 4321-4448. 

GAGTTANATTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATGGGCTACAGAAAAAAA 
AGGTTCTGGGATCTACATGGCAGGGAGGGCTGCACTGACATTGATGCCTGGGGGACCT 
TTTGCCTCGA GGCTGAGCTGGAAAATCTTGAAAATA'lTl'ril'l 1 Tl'CCTGTGGCACATTC 
AGGTTGAATACAAGAACTATTTTTGTGACTATGTTTTTGATGACCTAAGGGAACTGACC 
ATTGTAATTTTTGTACCANTGAACCANGAGATTTAAGTGCTTTTATATTCATTTCCTTGC 



ATTTAAGAAAATATGAAAGCTTAAGGAATTATGTGAGCTTAAAACTAGTCAAGCANTT 
TAGAACCAAAGGCCTATNTTNATAACCGCAACTATGCTNAAAAGNACAAAGTAGTACA 
GNATATTGNTATGTACATATCATTTGGTAATACACNCCNGGCNTTCTGTACATATATGT 
ATTACATTTCTACNTTTTTAATACTGCCNTGGGCTTATGCCNTTAAGGTTAANTTGNGAT 
AAATTTN GGCTGTTCCNGTNTATNCNATACNCTTTT 



Sequence of BAC4 using primer C2AS15, which spans nucleotides 4680-4700 of the cDNA. Exon 
sequence is underlined and represents nucleotides 4660-4683. 

ATGAGAATGTAAT ACATATATGTACAGAATGCCAGGACTGTATTAACAATGATATGTA 
CATAACAATATACTGTACTACTTTGTACTTTTCAGCATAGTTGCGGTTATTAATATAGG 
CCTTTGGTTCTAAACTGCTTGACTAGTTTTAAGCTCACATAATTCCTTAAGCTTTCATAT 
TTTCTTAAATGCAAGGAAATGAATATAAAAGCACTAAATCTCCTGGTTCACTGGTACAA 
AAATTACAATGGTCAGTTGCCTTAGGTCATCAAAAACTAGTCACAAAAATAGTTCTTGT 
ATTCAACCTGAATGTGCCACAGGAAAAAAAAAATATTTTCAAGATTTTCCAGCTCAGC 
CTCGAGGCAAAAGGCCCCCAGGCATCAATGTCAGNGCAGCCCTCCTGCCATGTAGATC 
CCAGAACCTTTTTTTTCTGTAGGCCATCTATTCTAACACTACTCTGCAGGGAGAATAAA 
g^TCTAAAGNCCAGCTCAAGAGTGCTACCACACCTTTGTTAAGACACAATGAAAACTTT 
JiGATATTGGCAGGNGAGATTTAAAAAAAAATGTGCCCTTTCTTACCACTCCTATAGNA 
ykAGTCTGGTTAAGAAATAACCGTTGGTCTTTATTTTCCTTTTNTTTCCCCTTCCC 
MSICTTCCTGGGGCTCGG 



Ref 14.1 




HC2A 

KIAA ASGNLDKN ARFSAI YRQDSNKLSN DDMLKLLAD FRKPEKMAKLPV I LGNLD I T I DNVS S D 

rat ; 

HC4 

HC1 : 

HC3 

HC5 



HC2A 

KIAA FPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKHTQPYTIYTNHLYVYPKYLKYDSQ 

rat ■ 

HC4 

HC1 

HC3 

HC5 

HC2A VLHHHQN PEFYDE I K 

KIAA KS FAKARN IAICIEFKDSDEEDSQPLKC I YGRPGGPVFTRSAFAAVLHHHQN PEFYDE I K 

rat 

HC4 



Jfc3 : 

W&5 

J 3 = 

flgC2A IELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 

j]§IAA IELPTQLHEKHHLLLTFFHVSCDHSSKGSTKKRDWETQVGYSWLPLLKDGRWTSEQHI 

uSat 

3C4 

fgci 

gfic3 

Q§C2A PVSANLPSGYLGYQEIX^GRHYGPEIKWVTCGKPLLKISTHLVSTVYTQDQHLHNFFQYC 

^IAA PVSANLPSGYLGYQEI^GRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 

"rat * 

HC4 

HC1 

HC3 GPGPARSTVS I SLI SNSARV 

HC5 " 

HC2A QKT ESGAQALGNELVKYLKSLHAMEGHVM IAFLPT I LNQLFRVLT - RATQEEVAVNVTRV 

KIAA QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPT I LNQLFRVLT- RATQEEVAVNVTRV 

rat 

HC4 MEIQVLIRFLSVILMQLFWVLPNMIHEDDVPISCPMV 

HC1 MSFLPII LNQLFKVLV-QNEEDE I TTTVTRV 

HC3 NRSRSLSNSNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 

HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 





Kef 



HC2A IIHWAQCH^«ESHLRSyVKYAYKAEPYVASEyKTVHEEl(^MSTTILKPSADFLTSN 

KIAA 1 1 HWAQCHi^fesHLRS YVKYAYKAEPYVASEYKTVHEeSKmTTILKPSADFLTSN 

rat 

HC4 LraiVSKCHEEGLDSYLSSFIKYSFRPGKPSAPQAPLIHETIATMMIALLKQSADFLAIN 

HC1 LP DI VAKCHEEQLDHS VQS Y I KFVFKTR ACKERPVHEDLAKNVTGLLK- SN DS PTVK 

HC3 TQAMDRSCNRMSSHTETSSFLQTLTGRLP TKKLFHEELALQWWCSG — SVR E 

HC5 



Cadherin 



KLLR Y S W FFFDVL I K SMAQHL I EN S KVKLI 
KLLKYSWFFFDVLI KSMAQHLI EN S KVKLI 


RNQR 
RNQR 


fPASYHHAAETWNMLMPHITQKFGD 
?PAS YHHAVET WNMLMPH I TQKFRD 


KLLKYSWFFFEI IAKSMATYLLEENKIKL1 
HVLl^iSWFFFAI ILKSMAQHLI DTNKIQLE 

salqqawfffelm|/ksmvhhl Y FNDKLEAF 


HGQR 
RPQR 

RKSR 



TPKAYHHALHSLFLAIT-IVESQYAE 
rPESYQNELDNLVMVLSDHVIWKYKD 
rPERFMDDI AAL VS T IAS DI VS RFQK 



6 I. 



HC2A NPEASKNANHSLAVFI KRC FT FWDRGFVFKQIN NY-IS — CFAPGDe|kTLFEYKFEFL % • I 

KIAA NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS — CFAPGDPKTLFEYKFEFL 

rat 

HC4 IPKESRNVNYSLASFLKCCLTLMDRGFVFNLIN DYIS — GFSPKDPKVLAEYKFEFL 

HC1 ALEETRRATH S VARFLltlC FT FMDRGCVFKMVN NYIS — MFSSGDLKTLCQYKFDFL 1 • I 

HC3 DTEMVERLNTSIAFFLNDLLSVMDRGFVFSLIKSCYKQVSSIO,YSLPNPSVl|/SLRLDFL 3.1 / 3-2 

HC5 



HG2A RWCNHEHYIPLNLPM PFGKGRIQR YQDLQL DYSLTDEF 

KIAA RWCNHEHYIPLNLPM PFGKGRIQR YQDLQL DYSLTDEF 

r^t , 

fip4 QT I CNHEHY I PLNL PM AFAKPKLQR VQDSNL E Y S LS DE Y 

gCl QEVCQHEHFI PLCLPIRSANI PDPLTPSES TQELHASDMPEYSVTNEF 

fiC3 RIICSHEHYVTLNLPCSLLTPPASPSPSVSSAljSQSSGFSTNVQDQKIANMFELS — VPF I / 

HC5 MNADTAPTSPCPSIS SQNSSSCSSFQDQKIASMFDRTSRVPA 

St2A CRNH FLVGLLLREVGTALQE FRE — — VRLI AI S VLKNLL I KHS FDDRYASpjsHQARI AT 3- I 

^EAA CRNHFLVGLLLREVGTALQEFRE VRLI AI S VLKNLL I KHS FDDRYAS RS HQAR I AT 

i^at 



CKHHFLVGLLLRETSIALQDNYE IRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQ 

gpl CRKHFLljGILLREVGFALQEDQD VRHLALAVLKNLMAKHS FDDRYRE PRKQAQ IAS 9- \ 

HC3 RQQH YLAGLVLTELAVI LDPDAEGLFGI*HKKVINMVHNLLS SHDS DPRYS DPQ I KARVAM 

HC5 SSTS-SPGLLFTELAAALDAEGEGISEVQRKAVSAIHSIJ»SSHDLDPRCVKPEVKVKIAA 

HC2A LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESIJHjPAV^ 

KIAA LYLPLFGIiIENVQRINVRDVSPPP\WW^-MTVKDESIJ^ 

rat - 

HC4 LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP — AN — RGSLS 

HC1 LYMPLYGMLLDNMPRI YIJCDLYPFTVNTSN(fcSRDD3*STNGGFQSQTAIKHANSVDTS FS <\ . / 

HC3 LYLPLIG I IMETVPQLY DFTETHNQRGRP I CI AT DDYESE SG SMIS 

HC5 LYLPLVGIILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 

HC2A kdllgaisgJaspyttstpninsvrnadsrgslistdsgnslpernseksnsldk|iqqss s I / S 

kiaa kdllgaisgiaspyttstpninsvrnadsrgslistdsgnslpernseksnsldkhqqss 

rat - — — - 

HC4 TDKDTAYGSFQNG — HGIKREDSRGSLI P-EGATGFPDQGNTGEN TRQS 

HC1 KDVLN S I AfcVFS S IAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKI PRPL | O . » 

HC3 QTVAMAI AGT S VPQ LTRPGSFLLTSJtSGRQHT 

HC5 QNVALAI AGNN FN LKTSG- I VLS S|LP YKQ YN ^ j 



HC2A TLGNSVVRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

KIAA TLGNSVVRCDKLI>3SEIKSLI^FLYILKSMSDDALFTYWN-KASTSEU4DFFTISEVCL 

rat ^ 

HC4 STRSSVSQYNRLDQYEIMLI^CYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 

HC1 ALIGSTLRFDRLDQAETRSLLMCFLHIMKTISYETLIAYWQ-RAPSPEVSDFFSIlbvCL M- I / II. 2. 

HC3 T FS AE S S RS LL I CLLWVLKN - ADET VLQKWFT DLS VLQLNRLLDLL YLC V 

HC5 MLNADTTRNLMICFLWIMKN-ADQSLIRKWIADLPSTQLNRILDLLFICV 



HC2A HQF|QYMGKRYIARNQEGLG — PIVHDRKS QTLPVSRNRTGMM i 

KIAA HQ FQ YMGKR Y I AR TGMM 

rat 

HC 4 FH FRYMGKRN IARVH DAWLS KHFG I DRKS QTMPALRNRSG VM 

HC1 QNFRYLGKRNIIRKIAAAF — KFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHK 

HC3 SCFEYKGKKVFERMNSLTFK — KSKDMRAK LEEAILGSIGARQEMV 

HC 5 LCFE YKGKQSS DKVSTQVLQ- -KSRDVKAR LEEALLRGEGARGEMM 



HC2A HARLQQL GSLDNS LT FNHS YGHSDADVLHQSLLEAN IATE VC 

KIAA HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

rat 

HC4 QARLQHL SSLESS FTLNHSSTTTEADI FHQALLEGNTATEVS 

HC1 QHRSQTI£IIRGK NALSNPKL LQMLDNTmfeNSNEIDIVHHVDTEANIATEGC fZ./ / I Z . 2_ 

Kf3 RRS RGQip RS P S G S A FG S QENLRWRKDMTHWRQN T EKLDKS RAE I EH EAL I DGNLAT EAN / 6 ^ 

HC 5 RRRAPGNDRFP GLNENLRWKKEQTHWRQANEKLDKTKAEL DQEALI SGNLATEAH 



4§2A LTALDTLSLFTLAFK^QLIADHGHNPLMKKVFDVYLCFLQKHQSETALBCNVFTALRSLIY ~1 - I 

KIAA LTALDTLSLFTLAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 

KLSRGHSPIMKKVFDVYLCFLQKHQSEMALKNVFTALRSLIY 

H^? 4 LTVLDTISFFTQCFKTHFLNNDGHNPLMKKVFDIHIAFLKNGQSEVSLKHVFASLRAFIS 

LTILDLVSLFTQTHQRQUX^DCQNSIJ«KRGFDTYMLFFQVNQSATALK3JVFASLRLFVC| I 3 . 1 
HC3 LI ILDTLEI WQTVS — VTES- -KES ILGGVItfCVLLHSMACNQSAVYLQHCFATQRALVS 

LIILDMQENIIQASS — ALDC — KDSLLGG\n^RVLVNSLNCDQSTTYLTHCFAjrLRALIA ? / 



H€5 



H?2A KFPSTH^EGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFIMRNNFDYTGKKSFVRTH 

ffAA KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 

KFPSTFYEGRADMCASI^YEVLKCCNSIO^SSIRTEASQLLYFI^^ 

HC4 KFPSAFFKGRVNMCAAFCYEVLKCCTSKISSTRNEASALLYLLMRNNFEYTIO^ 

HC1 KFPSAFFQGPADLCGSFCYEVLKCCNHRSRSTQTEASALLYLEW 

HC3 I^PELLFEEETEQCADLCLRLLRHCSSSIGTIRSHPSASLYLLMRQNFEIGN — NFARVK /l-2- 

HC5 KFGDLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFSFGATS — NFARVK 



HC2A LQVI I S VSQL I ADWG I GETRFQQSLS I INNCAN S DRLI KHTS FS S DVKDLTKRI RTVU4 

KIAA LQVI I S VSQLI ADWGI GGTRFQQSLS I INNCAN S DRL I KHTSFS S DVKDLTKRI RTVLM 

rat LQVI I SLSQLI ADWG I GGTRFQQSLS 1 1 NNCANS DRLI KHTS FS S DVKDLTKRI RTVLM 

HC4 LQI I IAVSQLIADVALSGGSRFQESLFIINNFANSDRPMLARAFPAEVKDLTKRIRTVLM , 

HC1 LQJJ I KAVSQL I AD- AG I GGS RFQHSLAI TNN FANGDKQMKti|5N FPAEVKDLTKRI RTVLM /If .)/|4- tj \g~J I \S 

HC3 MQVPMSLSSLVGTSQNFNEEFLRRSLKTILTYAEEDLELRETTFPDQVQDLVFNLHMILS ' 

HC5 MQVTMSLASLVGRAPDFNEEHLRRSLRTILAYSEEDTAMQMTPFPTQVEELLCNLNSILY 





R«f. 



Transmembrane 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



ATAQMKEHENDPEMLVDLQYSLAKSYASTPELRKTWLDSMARIHVKNG1 
ATAQMKEHENDPEMLVDLQYSLAKSYASTPELRKTWLDSMJ^IHVKNGI 
ATAQMKEHEN DPEMLV DLQ YSLAKS YAST PELRKTWLDSMARI HVKNG1 
ATAQMKEHEKDPEMLI DLQY SLAKS YAST PELRKTWLDSMARI HVKNG1 
AT AQMKEHEKDPEMLV DLQY SLAN S YAST PELRRTWLESMAKI HARNG1 
DTVKMKEHQEDPEMLIDLMYRIAKGYQTSPDLRLTWLQNMAGKHSERS1 
DTVKMREFQEDPEMLMDLMYRIAKSYQASPDLRLTWLQNMAEKHTKKK< 



»SEAAMCYVHV 
.SEAAMCYVHV 
.SEAAMCYVHV 
'SEAAMCYVHV 
lE^AMCYIHI 
lQCLVHS 
TEAAMCLVHA 



SH3 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



TALVAEYI TRKGV 

TALVAEYI TRKEA 

TALVAEYI TRKEAD 
AALVAEFI HRKKL 



FRQGCTAFRVITPN 

— VQWEPPLLPHSHSACLRRfeRGGVFRQGCTAFRVITPN 
LALQREPPVFPYSHTSCQRK 3RGGMFRQGCTAFRVITPN 

—FPNGCSAFKKITPN 

AALIAEYflKRKGYWKVEKldTASLLSEDTHPCDSNSLLTTP 5GGSMFSMGWPAFLS ITPN 

AALVAEYIjSMLED f™ RKYLPVGCVTFQ^ISSN | ( ? • X 

— HSYLPVGSVSFQNISSN 



HC2A IDEEASMMEDVG^IQD- VHFNEDVLMELLEQCADGLWKAERYELIADI YKLI IPI <P * [ 

KIAA IDEJlASMMEDVGMQD VHFNEDVIJffiLLEQCADGLWKAERYELIADI YKLI IPI 

ra t I D^^SMMEDVGMQD- VHFNEDVLMELLEQCADGLWKAERLRAGLLTSINSSS P 

HC4 I DEEGAMKEDAGMMD VHYSEEVLLELLEQCVNGLWKAERYEI I SEI SKLIGPI 

HQ. I KEEGAAKEDSGMHD T P YNE|N I LVEQL YMCGE FLWKSERYELI ADVNKP 1 1 AV | "I ./ / IT- 

HCB VLEESAVS DDWS PDEEG I C SGKYFTESGLVGLLEQAAAS FSMAGMYEAVNEVYKVLI P I 

l&S VLEESWSEDTLS PDE DGVCAGQYFTESGLVGLLEQAAEL FSTGGLYETVNEVYKLV IPI 

5 '6 S 

Q ITAM ITAM ITAM ITAM 

tp2A YEKPfctD h h" i f~ " ~ T % ' 

Kf AA YEKRRDFERLAHL f DTL 1R2 YSKV ?EVMHSGRRLLGT frFRM AFFGQAAQYQFTDSETDVE 

rijt SMKSGGTLETTHL f DTL IRE YSKV *EVZ TR A AGSWDLLPGGLFGQ 

HC4 YENRREFENLTQV f RTL 1G? YTKI jEVMHTKKRLLG TFjERiZAFYGQ 

HC1 FEKQRDFKKLSDL fYDI iR£ YLKV ^EWNSEKRLFG R fYRV \FYGQ 

«e3 H ET^iRDAKKLST I *GKL JE? FSKI THC^TGWERMFG T fFRV 5FYG- ^ , \ 

aC5 LEAHREFRKLTLT iSKL JRf FDSI 7NKDH — KRMFG TflTRVEFFG- 



hS2A -FFEDEDGKEYIYKEPKLTPLSEISQRIJjKLYSDKFGSENVKMIQDSGI^NPKDLDSKYA lo.\ 

giAA gffe:dedgkeyiykepkltplseisqrllklysdkfgsenvkmiqdsgkvnpkdldskya 

fit gffededgbobyi ykepkltplsei sqrllklys dkfgsenvkmiqdsgkvnpkdldskfa 

HC4 sfeeeedgkeyiykepkltglseislrlvklygekfgtenvki iqdsdkvnakeldpkya 

HC1 GF^i^USGKEYl YKEPKLTGLSEI SQRLLKLYADKFGADNVKI IQDSNKVNPKDLDPKYA 

HC3 TKifeliLliBQjEFVYK^ ( KO-Z. 

HC5 SKFGDLD^EfyYKEPAITKLPEISHRLm 4-. | 

HC2A YI<pi^iPFf©^ / ,K 2 - 

KIAA Y I QVTHViP^ I RRFMFEMPFTQTGKRQGGVEEQGKRRT I LT A 

rat Y I QWHVTPFFDEKE^ERKTEFERCHN I RRFMFEMPFTQTGKROGGVBEQGW^RT ILTA 

HC4 HIQVTYVKPYFDDKELTERKTEFERNHNISRFVFEAPYTLSGKKQGCIEEQCK^ 

HC1 YIQVTYVTPFFEEKEIEDRKTDFEMHHNINRFVFETPFTLSGIOCHGGVAEQCKRRTILm \8- I 

HC3 Y I QITYVEP YFDT YEMKDRI T YFDKNYNLRRFMYCT PFTL DGRAHGELHEQFKRKT I LIT 

HC5 Y IQIT FVEPY FDEYS4KDRVTYFEKN FNLRREWYTT PFTLEGRPRGELHEQYRRNTVLTT 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 




Coiled-Co 




I HC FP YVKKR^PfMYQHHT DLN E I EVAI DEMSKKVAELRQLC U W iL VDM I KLQLKLQGfe V 
IHCFPYVKKRIPVMYQHHTDLNE IEVAIDEMSKKVAELRQLCSSAEVDMIKLQLKLQG 3V 
I HC FPYVKKRI PVMYQHHT DLN E I EVAI DEMSKKVAELHQLCS SAEVDMI KLQLKLQG 3V 
SNSFPYVKKRIPINCEQQINLKI IDGATDEIKDKTAELQKLCSSTDVDMIQLQLKLQG *V 
SHL FPYVKKRI QV I SQSSTELNE I EVAI DEMSRKVSELNQLCTMEEVCMI SLQLKLQG >V 
SHAFPYIKTRVNVTHKE^IILTE I EVAI EDMQKKTQELAFATHQDPADPKMLQMVLQG 3V 
MHAFPYIKTRISVIQKEEFVLTE I EVAI E DMKKKTLQLAVAI NQE P P DAKMliQMVLQG 3V 



Coiled-Coil 2 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HG3 

Ms 



t$€2A 
KtAA 



HC4 

MCI 
Hb3 

HC5 



SVC^AGPIAYARAFLDDTNTKRYPDNKVKLLKE^ 
SVQVHhG PLAYARAFLDDTNTKRY P DNKVKLLKEVFRQFVEACGQA ^AVNERLIKEDQLE 
SVQVNAGPLAYARAFLDDTNTKRYPDNKVKLLKEVFRQFVEACGQA -AVNERLIKEDQLE 
SVQVNAGPLAYARAFLNDSQASKYPPKKVSELKDMFRKFIQACSIA jELNERL I KE DQVE 
SVKVNAGPMAYARAFLEETNAKKYPDNQVKLLKEI FRQFADACGQA jDVNERLIKEDQIjE 
GTT VNQG PLE VAQ VFLS E I PSDPKLFRHHNKLRLCFKDFTKRCEDA jRKN KSI* I G P VQKE 
GATVNQGPLEVAQVFLAEI PADPKLYRHHNKLRLCFKEFIMRCGEA /EKNKRLITADQRE 



Coiled-Coil 2 



YQEEMKANYREMAKELSEIMHEQI -PLEEKTS-VLPNSLHIFNAISGTPTSTMVHGMTSS 

YQEEMKANYREMAKELSEIMHEQIi 3 

YQEEMKANYREIRKELSDIIVPRI ^PGEDKRATKFPAHIiQRHQRDTNKHSGSRVDQFILS 
YHEGLKSNFRDMVKELSDIIHEQI jQEDTMHS PWMSNTLHVFCAI SGT S S DRGFcSfc PR YA 

YQEELRSHYKDMLSELSTVhfaEQI TGRDDLSK RGVDQTCTRV I SKAT PALPTVS I S S 

YQRELG KLSS ?Z— 

YQQELKKN YNKLKENLRPMI ERKI PELYKP I FRVESQKRDS FHRS S FRKCETQLSQG S Z - 



PBM 



CVTLPHEPHVGTCFVMCKLRTTFRANHWFCQAQEEAMGNGREKEPWTVIFNSRFYRSWGK 



SAEV 



I'M 



mc2A 

tldlAA 
gat 

HC1 
HC3 
HC5 



VHIFF 





-O 



4. 




',.:ft.-V:-."..x-:;:.. 

-mi; 




~ 4.5 kb 



1 .85 kb 



m 

If 



W0 



m 



IBte 



- 4.5 kb 



1 .85 kb 



^5 




genomic DNA 

FW|.7 



BAC 6 DNA 




59 kD 
57 kD 



Fldi. & 



none GST alone TIAM-1 PSD95 DLG1 neDLG 

GST-PDZ fusion protein 



f)t\. °ik 



a> 



to 

£ 

fx 

CD 



O 
CO 



1.2 r 



DLG1 
NeDLG 

PSD95 



0.8 - 



0.6 



0.4 - 



0.2 - 




10 15 20 25 

Clasp 2 peptide (uM) 



t 



in 



Q 
O 



0.35 r 

0.3 
0.25 

0.2 
0.15 

0.1 



0.05 
0 



: I 



□ 5 uM CLASP2 

5 uM CLASP2 + 100 uM CLASP2 Inhibitor 
5 uM CLASP2 + 100uM KV1.3 Inhibitor 



j i — i 




■ 



1 



-I L 




PSD95 



NeDLG 



PDZ Protein 



FMj. <\L 




PSD95 NeDLG 

PDZ Protein 



I 10 I 20 | 30 | 40 

1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACC6G 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT 
721 ACTTGAGGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
13 61 GACAGCCCTC CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
iCOl ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
l^ei TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1641 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
1&2S1 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
20$ 1 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2 Ml AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
2j}l GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
i^Til GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2\bl CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
24 81 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
1 561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
£#41 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
gf 61 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3441 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
]g21 ACGAGGATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
=3^01 ATCTACAAAC TTATCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4 081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
44B1 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4 561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4 641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4 801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4 881 TGTATACAAG TCTTTACT 

I 10 I 20 I 30 I 40 



* 



I 50 I 60 I 70 I 80 

GCCC CCCCTC GAGGTCGACG GTATCGATAA GCTTGATATC 80 
TTTTA TGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 4 80 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 144 0 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CC ACTCG GAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 224 0 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTC TACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 24 00 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 24 80 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCT GG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTG GATT CT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGGGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTQGG 44 80 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCItSATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
CATTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 

4898 

I 50 1 60 I 70 I 80 



I 10 I 20 I 30 | 40 

1 MEGHVMIAFL PTILHQLFRV LTRATQEEVA VNVTRVTIHV 
81 SMTTILKPSA DFLTSNKLLR YSWFFFDVLI KSMAQHLIEN 
161 SKNANHSIAV FIKRCFTFMD RGFVFKQINN YISCFAPGDP 
241 QLDYSLTDEF CRNHFLVGLL LREVGTALQE FREVRLIAIS 
321 RINVRDVSPF PVNAGMTVKD ESIALPAVNP LVTPQKGSTL 
401 TDSGNSLPER NSEKSNSLDK HQQSSTLGNS WRCDKLDQS 
4 81 EVCIuHQFQYM GKRYIARNGE GLGPIVHDRK SQTLPVSRKR 
561 IATEVCLTAL DTLSLFTLAF KNQLLADHGH NPLMKKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFLM RNNFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR I RTVLMATAQ MKEHENDPEM 
801 MCYVHVTALV AEYLrTRKGVF RQGCTAFRV2 TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQRLL 
961 FFDE KELQER KTEFERSHMI RRFMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM 1KLQLKLQGS VSVOVKAGPL 
1121 NERLI KEDQL EYQEEMKANY REMAKE LS EI MHEQICPLEE 
| 10 | 20 | 30 | 40 



| 50 | 60 | 70 | 80 

VAQCHEEGLE SHLRSYVKYA YKAEPYVASE YKTVHEELTK 80 
SKVKLLRNQR FPASYHHAAE TWNMLMPHI TQKFGDNPEA 160 
KTLFEYKFEF LRWCNHEKY IPLNLPMPFG KGRIQRYQDL 240 
VLKKLLIKHS FDDRYASRSH QARIATLYLP LFGLLIENVQ 320 
DNSLHKDLLG AISGIASPYT TSTPN1NSVR KADSRGSLIS 400 
EIKSLLMCFL YILKSMSDDA LFTYWNKAST SELMDFFTIS 480 
TGMMHARLQQ LGS LDNSLTF NHSYGHSDAD VLHQSLL£AN 560 
YLCFLQKHQS ETALKNVFTA LRSLIYKFPS TFYEGRADMC 640 
SFVRTHLQVI ISVSQLIADV VGIGETRFQQ SLSI1NNCAK 720 
LVDLQYSLAK SYASTPELRK TWLDSMARIH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVLMELL EQCADGLWKA ERYELIADIY 880 
KIiYSDKFGSE NVKMIQDSGK VNPKDLDSKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DLNPIEVAID 1040 
AYARAFLDDT NTKRYPDNKV KLLKEVFRQF VEACGQA1AV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1194 

| 50 | 60 | 70 | 80 




I xo 

1 AATTGTAATA 
81 GAATTCGGCA 
161 ATGAAAAGCA 
241 GTCGTTGAAA 
321 GGTCTCGGCG 
401 GGGTAGATGG 
4 81 ATTTTTTCCA 
561 CATGCGATGG 
641 ACAGGAAGAA 
721 ACTTGAGGTC 
801 ACCAAATCCA 
881 CTTTGATGTA 
961 CTGCATCCTA 
1041 GAGGCATCTA 
1121 CAAGCAGATC 
1201 GTGTAGT GT G 
1281 GACCTCCAGC 
1361 GACAGCCCTC 
1441 ATGACAGATA 
1521 GTCCAGCGGA 
1601 ACCAGCTGTG 
1681 TCTCCGGCAT 
1761 ATAAGCACAG 
16>1 CACATTGGGA 
192a TCTTAAAGAG 
2<&1 ATATCTGAAG 
201 AGTTCATGAT 
2 £61 GCAGCCTGGA 
22*1 GCCAACATTG 
23^1 GGCCGACCAT 
2$$1 CGGCTTTAAA 
2461 ATGTGTGCGG 
25=61 GCTCTACTTC 
2641 CTGTCAGCCA 
2721 GCCAACAGTG 
2801 AATGGCCACC 
i&81 ATGCCAGCAC 
2961 GCAGCAATGT 
H41 CGCCTTCAGG 
^121 ACGAGGATGT 
p201 ATCTACAAAC 
3281 GGAACCCAAA 
SI 61 TCAAAATGAT 
g[TJ41 ATCCCCTTCT 
1T521 TGAGATGCCA 
3601 TACACTGCTT 
3681 ATTGACGAGA 
3761 CAAACTCCAG 
3841 CAAAGCGATA 
3921 GCGGTAAACG 
4001 GGAGCTTTCT 
4081 TCAACGCCAT 
4161 ATGGCCCGTG 
4241 CAGGGAGGAC 
4321 AGAAGGTGCA 
4401 TCTTX5AGCTG 
4481 ATCTACATGG 
4561 AATCAGGGTA 
4641 ATAGAGCAAG 
4721 TCATCGACTC 
4801 TGTGATAAAT 
4881 TGTATACAAG 

I 10 



i 2 ° ! 30 I 40 

CGACTCACTA TAGGGCGAAT TGGGTACCGG 
CGAGTTTTAC ACCATCACCA AAACCCAGAA 
CCACCTGTTG CTCACATTCT TCCATGTCAG 
CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
AACCTTCCTT CGGGCTATCT TGGCTACCAA 
AGG CAAG CCA CTGCTGAAAA TTTCCACTCA 
GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
AAGGCCACGT GATGATCGCC TTCTTGCCCA 
GTCGCGGTTA ACGTGACTGG GGTCATTATT 
ATATGTTAAG TACGCGTATA AGGCTGAGCC 
TGACCACGAT TCTCAAGCCT TCTGCCGATT 
CTGATCAAAT CTATGGCTCA GCATTTGATA 
TCATCATGCA GCGGAAACCG TTGTAAATAT 
AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
AACAACTACA TTAGCTGTTT TGCTCCTGGA 
CAACCATGAA CATTATATTC CGTTGAACTT 
TTGACTACTC ATTAACAGAT GAGTTCTGCA 
CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
TGCTTCTCCA TATACAACCT CAACTCCAAA 
ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
CATGTCTGAT GATGCTTTCT TTACATATTG 
TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
CGA AAGTC TC AGACATTGCC TGTTTCCCGT 
TAACTCTCTC ACTTTTAACC ACAGCTATGG 
CTACTGAGGT TTGCCTGACA GCTCTGGACA 
GGACATAATC CTCTCATGAA AAAAGTTTTT 
AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
CTCTGTOPTA CGAGATTCTC AAGTGCTGTA 
CTGATGAGGA ACAACTTTGA TTACACTGGA 
GCTGATAGCA GACGTTGTTG GCATTGGGGA 
ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
GCCCAGATGA AGGAGCATGA GAACGACCCA 
GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
GTCATXACCC CAAACATCGA CGAGGAGGCC 
GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
TTATCATCCC CATTTATGAG AAGCGGAGGG 
CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
TGAGTAAGAA GGTGGCGGAG CTCOGGCAGC 
GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
TC CTCAC AAT AAAGTGAAGC TGCTTAAGGA 
AACGTCTGAT TAAAGAAGAC CAGCT06AGT 
GAAATCATGC ATGAGCAGAT CTOCCCCCTO 
CAGTGGGACT CCAACAAGCA CAATGGTTCA 
SCTGGGGACT TGCTTTGTCA TTTGCAAACT 
CAA GGGGAAG GGGAGAGAAA GGAAATAAAG 
CATATTTTTT TAAATCTCAC TGGCAATATT 
GACTTAGATT TTATTCTTCC TTGCAGAGTA 
CAGGGAGGGC TGCACTGACA TTGATGCCTG 
CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
AATAGTGAGC TAACTGAGCT AGACACTCAA 
CGGGAOGGTC ATATATGTAT TACATTTCTA 
TTGTOCPGCT CCAGTATATG CAATACACTT 
TCTTTACT 

I 20 I 30 I 40 



I 50 
GCCCCCCCTC 
TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCITGGGA 
TCTGGTTTCT 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTC 
ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
TGAACGCGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGCGATACAT 
AACAGAACAG 
CCACTCGGAC 
CGCTTTCTCT 
GATGTCTACC 
TTATAAGTTT 
ACTCCAAGCT 
AAGAAGTCCT 
AACCAGATTC 
CTGATGTGAA 
GAGATGCTGG 
CATGGCCAGG 
AATATCTCAC 
TCCATGATGG 
TGGACTCTGG 
ATTTCTTTGA 
CTCCTTAAAC 
TCTGGATTCT 
CAGAGTTTGA 
GTGGAAGAGC 
GTACCAGCAC 
TGTGCTCCTC 
CCACTAGCAT 
AGTTTTCAGG 
ATCAGGAAGA 
GAGGAGAAGA 
CGGGATGACC 
CAGGATGCTT 
AACAAOGTTA 
CAAAGTTTTC 
GTGTTAGAAT 
GGGGACCTTT 
TTGTATGACT 
TTAATCCGCT 
CATTTTTAAT 
TAATOGTTTA 



I 60 
GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACCTTC 
CCCTCTTTGA 
TTTGGAAAAG 
CTTGGTGGGA 
TCAAGAACCT 
CTGCCTCTGT 
CATGACCGTG 
ACAGCCTGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAACATCTG 
AGCCAGGAAC 
GAATGATGCA 
GCAGATGTTC 
ATTTACATTG 
TGTGTTTTCT 
CCCTCAACAT 
GAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 
ACGGAAAGGC 
AAGACGTGGG 
AAAGCCGAGC 
AGATGAAGAT 
TGTACTCGGA 
AAGTATGCAT 
GAGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGTG 
AT6CGCGAGC 
CAATTTGTGG 
AATGAAAGCC 
CGAGCGTCTT 
AGCTCGTCTT 
TCCAAAGCCA 
TTTCTTAACA 
ATTX3TGTCTT 
AGATGGCCTA 
TGCCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACATGG 
TTCTTGTCAT 



I 70 
GTATCGATAA 
AGAGTTGCCC - 
GAAGCACGAA 
ACAAGCGAGC 
TTATGGTCCG 
TCAGGATCAG 
TAAAGTACCT 
CGAGTCCTCA 
TGAGGAAOGA 
AGACAGTGCA 
CTGAGGTACT 
GCTOCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 
GCAGGATTCA 
CTGTTACTGA 
GCTGATAAAG 
TTGGTCTGCT 
AAGGAlXaAAT 
CAAGGACCTG 
CTGATTCGAG 
GATAAGCACC 
ACTGATGTGT 
AACTTATGGA 
CAGGAGGGGT 
TGCCAGATTG 
TGCACCAGTC 
GCGTTTAAGA 
TCAAAAACAT 
TCTATGAAGG 
AGGACGGAGG 
ACATTTGCAA 
TGTCCATCA3P 
AAAAGGATAC 
GTACAGCCTG 
AAA A TG G C G A 
GTXaTTTAGAC 
GATGCAGGAT 
GCTACGAGCT 
GGAAAGGACT 
TAAATTTGGT 
ACATCCAGGT 
AACATCCGCC 
GCGCACCATC 
TGAACCCCAT 
GACATGATCA 
TTTCTTAGAT 
AAGCTTCCGG 
AACTACAGGG 
ACCGAATTCC 
CGGTCGTGTG 
ATCACTGGGG 
GACTTTCTAT 
AACAAAGGTG 
CAGAAAAAAA 
CGTGCCGGAA 
CTATTATCTC 
AGTCAGAACT 
GCTTATGCAT 
AAAAATGTGC 



I 



50 



I 



60 



I 



70 



I 80 

GCTTGATATC 80 

ACTCAGCTGC 160 
GAAGAGGGAT 240 
AGCACATCCC 320 
GAAATTAAAT 400 
CATTTACATA 4 80 
TAAGAGTCTG 560 
CCAGAGCCAC 640 
TTGGAGAGCC 720 
TGAAGAACTG 800 
CATGGTTTTT 880 
CAGAGATTTC 960 
AGATAATCCA 1040 
GCTTTGTCTT 1120 
GAATTTCTCC 1200 
AAGATACCAA 1280 
GGGAGGTOGG 1360 
CATTCTTTTG 1440 
GATTGAAAAC 1520 
CCCTGGCTCT 1600 
CTGGGCGCCA 1680 
AGGATCTCTC 1760 
AACAAAGTAG 1840 
TTCCTCTACA 1920 
TTTTTTTACA 2000 
TGGGACCCAT 2080 
CAGCAGCTGG 2160 
ATTACTTGAA 2240 
ACCAGCTCCT 2320 
CAGTCTGAAA 2400 
GAGAGCGGAC 24 80 
CCTCCCAGCT 2560 
GTCATCATAT 2640 
CAACAACTGT 2720 
GCACGGTGCT 2800 
GCCAAATCCT 2880 
TCTCTCAGAG 2960 
AAGGATGCAC 3040 
GTCCATTTCA 3120 
CATCGCCGAC 3200 
ATATTTACAA 3280 
TCTGAAAATG 3360 
GACTCACGTC 3440 
GCTTCATGTT 3520 
CTOACAGCCA 3600 
OGAGGTQGCC 3680 
AACTX3CAGCT 3760 
GATACAAACA 3840 
TCAAGCCTTA 3920 
AAATGGCGAA 4000 
CTTCACATCT 4080 
ATTACATCTC 4160 
AGACCGAGCA 4240 
AGGAGTTGTA 4320 
TSGTAGACAC 4400 
AGGTTCTGGG 4480 
ATCTGATCGT 4560 
ATTCAACAAC 4640 
TTG TCATT AA 4720 
TAAGTTTAAT 4800 
AATATGGAGA 4880 
4898 



I 



80 



I 10 I 20 | 30 | 40 I 50 | 60 | 70 | 80 

MEGHVMIAFL pTI LNQLFRV LTRATQEEVA VNVTRVIIHV VAQCHEEGLE SHLRSYVKYA YKAEFYVASE YKTVHEELTK 80 

Rl SMTTILKPSA DFLTSNKLLR YSWFFFDVLI KSMAQHLIEN SKVKLLRNQR FPASYHHAAE TWNMLMPHI TQKFGDNPEA 160 

161 SKNANHS1AV FIKRCFTFMD RGFVFKQ INN YISCFAPGDP KTLFEYKFEF LRWCNHEHY IPLNLPMPFG KGRIQRYQDL. 240 

241 OLDYSLTDEF CRNHFLVGLL LREVGTALQE FREVRLIAIS VLKNLLIKHS FDDRYASRSH QARXATLYIiP LFGLLIENVQ 320 

121 RINVRDVSPF FVNAGMTVXD ESLALPAVNP LVTPQKGSTL DNSLHKDLLG AISGIASPYT TSTPNINSVR NADSRGSLIS 400 

4D1 TDSGNSLPER NSEKSNSLDK HQQSSTLGNS WRCDKLDQS EIKSLLMCFL YILKSMSDDA LFTYWNKAST SELMDFFTIS 4 80 

dBl EVCLHOFQYM GKRYIARNQE GLGPIVHDRK SQTLFVSRNR TGMMHARLQQ LGSLDNSLTF NHSYGHSDAD VLHQSIiEAN 560 

"J TATEVCLTAli DTLSLFTLAF KNQLLADHGH NPLMKKVFDV YLCFLQKHQS ETALKNVFTA UlSLIYXFPS TFYEGRADMC 64 0 

641 AALCYEILKC CNSKLSSIRT EASQLLYFLM RNNFDYTGKK SFVRTHLQVI ISVSQLIADV VGIGETRFQQ SLSIINNCAN 720 

721 SDRIf I KHTSF SSDVKDLTKR IRTVXMATAQ MKEHENDPEM LVDLQYSLAX SYASTPELRK TWLDSMARIH VKNGDLSEAA 800 

SOI MCYVHVTALV AEYLTRXGVF RQGCTAFRVI TPNIDEEASM MEBVGMQDVH FNEDVLMELL EQCADGLWKA ERYELIADIY 880 

fifil KLI I PI YEKR RDFFEDEDGK EYIYXEPKLT PLSEISQRLL KLYSDKFGSE NVKMIQDSGK VNPKDLDSKY AYIQVTHVTP 960 

Q61 FFDEKELQER KTEFERSHNI RRFMFEMPFT QTGKRQGGVE EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DLNPIEVAID 1040 

inai EMSKKVAELR QbCSSAEVDM IKLQLKLQGS VSVQVNAGPL AYARAFLDDT NTKRYPDNKV KLLKEVFRQF VEACGQAIAV 1120 

mtttjitkeDOL EYQEEMKANY REMAKE USE I MHEQICPLEE KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1194 
1121 NERLIKEW^ | 20 | 30 | 40 | 50 | 60 | 70 | 80 





1 

1. 
;i 
11 

21 
01 
61 
61 
41 
21 
101 
181 
?61 



281 
361 
.441 

.521 
L601 

Lfiai 



1921 
2001 
2081 
2161 



I 10 
AATTGTAATA 
GAATTCGGCA 
ATGAAAAGCA 
GTCGTTGAAA 
GGTCTCGGCG 
GGGTAGATGG 
ATTTTTTCCA 
CATGCGATGG 
ACAGGAAGAA 
ACTTGAGGTC 
ACCAAATCCA 
CTTTGATGTA 
CTGCATCCTA 
541 GAGGCATCTA 
121 CAAGCAGATC 
201 GTGTAGTGTG 
GACCTCCAGC 
GACAGCCCTC 
ATGACAGATA 
GTCCAGCGGA 
ACCAGCTGTG 
TCTCCGGCAT 
L761 ATAAGCACAG 
1841 CACATTGGGA 
TCTTAAAGAG 
A^ATCTGAAG 

AprrrcATGAT 

G&AGCCTOGA 
2241 GCCAACATTG 
2321 GGCCGACCAT 
2401 ©SCSCTTTAAA 
2481 ATCTC5TGCGG 
2561 i^CTCTACrrC 
2641 rC J l Vl' C AGCCA 
2721 ^GCCAACAGTG 
2B01~AATGGCCACC 
2881 ATGCCAGCAC 

2 961» GCAGCAATGT 

3 04 3MDGCCTTCAGG 
3 12 l^ACGAGGATGT 
3 2 OINaTCTACAAAC 
3281 GGAACCCAAA 
3 3 61 , TCAAAATGAT 

3 44}^ATCCccrrcr 

3 52^ : TGAGATGCCA 
3 €<m TACACTGCTT 
3681 ATTGACGAGA 
3761 CAAACTCCAG 
3841 CAAAGCGATA 
3921 GCGGTAAACG 
4001 GGAGCTTTCT 
4081 TCAAOGCCAT 
4161 ATGGCCCGTG 
4241 CAGGGAGGAC 
4321 AGAAGGTGCA 
4401 TCTTGAGCTG 
4461 ATCTACATGG 
4561 AATCAGGGTA 
4641 ATAGAGCAAG 
4721 TCATCGACTC 
4801 TGTGATAAAT 
4881 TSTATACAAG 



I 20 I 30 ! 40 

CG ACTCAC TA TAGGGCGAAT TGGGTACCOG 
CXSAG im'AC ACCATCACCA AAACCCAGAA 
CCACCTGTTG CTCACATTCT TCCATGTCAG 
CCCAAGTTCG CTACTCCTO5 CTTCCCCTCC 
AACCTTCCTT CGGGCTATCT 7X3GCXACCAA 
AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
AAGGCCACGT GATGATCGCC TTCTTGCCCA 
GTCGCGGTTA ACGTOACTCG GGTCATTATT 
ATATGTTAAG TACGCGTATA AGGCTGAGCC 
TGACCACGAT TCTCAAGCCT TCTGCCGATT 
CTGATCAAAT • CTATOGCTCA GCATTTGATA 
TCATCATGCA GCGGAAACCG TTGTAAATAT 
AGAACGCGAA TCATAOCCTT GCTCTCTTCA 
AACAACTACA TTAGCTCTTT TGCTCCTGGA 
CAACCATGAA CATTATATTC CGTTGAACTT 
TTGACTACTC ATTAACAGAT GAGTTCTGCA 
CAGGAGTTCC GGGAGGTCOG TCIGATCGCC 
AGCCATCAGG CAAGGATAGC 
TCAATGTGAG GGATGTGTCA CCCI'IVUCW 
AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
TGCTTCTCCA TATACAACCT CAACTCCAAA 
ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
CATGTCTGAT GATT3CTTRJT TTACATATTG 
TCTCCCTGCA CCAGTTCCAG TACATGGGGA 
CGAAAGTCTC AGACATTOCC Tmi iLXXXJT 

^ctctctc AcrrrrAAcc acagctatog 

CTACTGAQGT TTT5CCTGACA GCTCTGGACA 
GG ACATAAT C CTCTCATGAA AAAAGTTTTT 
^JGTOTC ACIX3CC3TAA GGTCCTTAAT 
CTCmXiXTA CGAGATTCTC AAGTGCTGTA 
CTGATCAGGA ACAACTTTGA TTACACTGGA 
GCTGATAGCA GAOGTTGTTG GCATTGGGGA 
ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
GCCCAGATCA AGGAGCATGA GAACGACCCA 
GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
GCTATGTCCA CGTAACAGCC CIAGTGGCAG 
GTCATTACCC CAAACATCGA CGAGGAQGCC 
GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
TTATCATCCC CAT3TATOAG AAGCGGAGGG 
CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
ACAQGATTCT GGCAAGGTCA ACCCTAAGGA 
TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
TITACX SCAQA CCGGGAAGAG GCAGGGCGGG 
CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
TOAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
TCCTX3ACAAT AAAGTCAAGC TGCTTAAGGA 
AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
GAAATCATGC ATGAGCAGAT CTOCCCCCTG 
CAGTSGGACT CCAACAAGCA CAATOGTTCA 
TOTBQOBACT TGCTTTGTCA TXTOCAAACT 
CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
CATATTTTTT TAAATCTCAC TGGCAATATT 
GACTIAGATT TTATTCTTCC 7TCCAGAGTA 
C»GGGAGGGC TGCACTGACA TTGATGCCTC 
CAGAACTXAC TAGTTTTGTC TAGGAGTATG 
AATAGTGAGC TAACTGAGCT AGACACTCAA 
CGGGA OQGTC ATATA1X3TAT TACATTTCTA 
TXHBC TQGT CCAGTATATG CAATACACTT 
TCTTTACT 

' 20 I 30 | 40 



| 50 
6CCCCCCCTC 
TTTTATGATG 
CPGTGACAAC 
TGAAAGACGG 
GAGCTTOGGA 
TCTCGTTTCT 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTG 
ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
TOAACGOGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGCGATACAT 
AACAGAACAG 
CCACTCX3GAC 
CGCTTTCTCT 
GATGTCTACC 
TTATAAGTTT 
ACTCCAAGCT 
AAGAAGTCCT 
AACCAGATTC 
CTGATGTGAA 
GAGATGCTGG 
CATGGCCAGG 
AATATCTCAC 
TCCATGATGG 
TOGACTCTGG 
ATTTCTTTGA 
CTCCTTAAAC 
TCTGGATTCT 
CAjGA GTTTOA 
GTCGAAGAGC 
GTACCAGCAC 
TGTGCTCCTC 
CCACTAGCAT 
AGTTTTCAGG 
ATCAGGAAGA 
GAGGAGAAGA 
CGGGATGACC 
CAGGATGCTT 
AACAACGTTA 
CAAAGTTTTC 
GTGTXAGAAT 
GGGGACCTIT 
TTGTATGACT 
TTAATCCGCT 
CATTTTTAAT 
TAATGGTTrA 

| 50 



| 60 
GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTG7TC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 

cacatcactc 
tttcaccttc 
ccctCtttga 
tttggaaaag 
cttggtggga 
tcaagaacct 

CTGCCTCTGT 
CATGACCGTG 
ACAGCCTGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAACATCTG 
AGCCAGGAAC 
GAATGATGCA 
GCAGATGTTC 
ATTTACATTG 

CCCTCAACAT 
GAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 
ACGGAAAGGC 
AAGACGTGGG 
AAAGCCGAGC 
AGATGAAGAT 
TGTACTCGGA 
AAGTATGCAT 
GAGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGT6 
ATGCGCGAGC 
CAATITGTGG 
AATGAAAGCC 
CGAGCGTCTT 
AGCTCGTCTT 
TCCAAAGCCA 
T1TCTTAACA 
ATTGTGTCTT 
AGATGGCCTA 
TGCCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACATGG 
TTCTTGTCAT 

I eo 
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GTATCGATAA 
AGAGTTGCCC 
GAAGCACGAA 
ACAAGCGAGC 
TTATGGTCC6 
TCAGGATCAG 
TAAAGTACCT 
CGACTOCTCA 
TGAGGAAGGA 
AGACAGTGCA 
CTGAGGTACT 
GCTGCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 
GCAGGATTCA 
CTC5TTACTGA 
GCTGATAAAG 
TTOGTCTOCT 
AAGGATGAAT 



t 80 

©CTTGATATC 



GAAGAGGGAT 
AGCACATCCC 
GAAATTAAAT 
CATTTACATA 
TAAGAGTCTG 
CCAGAGCCAC 



TGAAGAACTG 
CATGGTITTT 
CMAGATTTC 
AGATAATCCA 
GCTTTGTCTT 
GAATTTCTCC 
AAGATACCAA 



CATTCTTTTG 
GATTGAAAAC 
CCCTGX3CTCT 



CTCAU atjuAG AGGATCTCTC 
GATAAGCACC AACAAAGTAG 
ACTGATGTGT TTCCTCTACA 
AACTTATGGA TTT3TTTACA 
^*®**®®®9T TGGGACCCAT 
T^GCCAGATTG CAGCAGCTGG 



ACCAOCTCCT 

TCAAAAACAT 
TCTA1X3AAGG 



a CCTCCCAGCT 

ACATTTGCAA GTCATCATAT 
1X5TO CATCKP CAACAACTGT 
AAAAGGATAC GCACGGTOCT 



AAAATGG CGA TCTCTCAGAG 
GTGTTTAGAC AAGGATQCAC 
GATGCAGGAT GTCCATTTCA 



GACATGA TCA AACTGCAOCT 
T i'lVlTAGAT GATACAAACA 



ACCGAATTCC CTTCACATCT 
CGGTOGTGTG ATTACATCTC 



OAcrrrcror aggagttgta 



WA MWA AGGTTCTOGG 
OGTCCOG QAA ATCTGATCGfT 
CTATTATCTC ATTCAACAAC 



GCTTATGCAT TAAGTTTAAT 
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eo 

160 

240 

320 

400 

460 

SCO 

640 

720 

800 

880 

960 

1040 

1120 

1200 

1280 

13 60 

1440 

1S20 

1600 

1680 

1760 

184a 

1920 
2000 
2080 
2160 
2240 
2320 
2400 
2460 
2560. 
2640 
2720 
2800 
2880 
2960 
3040 
3120 
3200 
3280 
3360 
3440 
3520 
3600 
3680 
3760 
3640 
3920 
4000 
4080 
4160 
4240 
4320 
4400 
4480 
4560 
4640 
4720 
4800 
4880 
4898 
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■ \ AO I 50 ' Jt~ vTr^RPYVASfi YTCTVHEELTK BO 

» ^ »^ 'sis ^ss s ^^^^^ss"« 

KEGHVMI^^ SSSI* XSWTFFDVL! *f»£^£ IS^WXF LRWCNKEHV SSjSeHVQ 320 

SMTTIU^SA ^^OTTtS RGFVFKQINN ^ff**^* $uS£l«HS TOMWASSSH C^^j^ HRDSRSSWS 400 
SKHMWS1AV *^£raUL WEVGTW^E FMWULMS AJSG1ASWT EEUKDFFTIS 4 80 

QUtfSLTDEF ^^Jj^KD E S1A1J?AVNP ^VTPQKGSTL YIWSMSDDA vMQSIiEM* 560 

RISV^SPF HOQSSTl^S VVK^^ xCWraWUAQ "SSLDH^TT SrMHC 640 

l TDSGNSLPER ^^^otb GliGPIVHDRX ^^^g etMJWVFTA ^£-I" ?qq ei ^hkNCAW 720 

1 EVCUIQFQVM GKKYI«^ ^qUadHGH I^^THWTrt ISVSQWAW VFWSOLSEMt 800 

1 j^QIAYTLM HSKTOTOT "^SsLMC SYASTFSU* TWUDSW^ ^ yEIjlMr y 88 0 

1 UCVE"*C 1RTV1J ^TAQ OT^EK FNEDVTXK^' S^WIP «°„ 

1 SDKLIKW TSF S !SSi K eVF RQGCTAFKVI TPNIDEEASW ^gj. svjMJQDSGK ^^toKHT DIOTIBVAU) 3.04O 

a mcyvhvt^ ww*eww veacoomav a«o 

1 KWlPrifBKR S^^^j MtFMFEMFFT OTGKRQGGVE ^AJ^FlJjnr KTKRYPBHIW "f^^jS- SSV V X194 
1 SSS^S SS XFIoaSGrm SWVHSKTSS SSW . 80 

j X 0 I - . 
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41 

21 
01 

.81 



1361 
1441 
1S21 
1601 
1681 
1761 
1841 



1 AATTCTAATA CGACTCACTA TAGGGCGAAT TGajTJuvw L- S0 I . 60 | 70 I fin 

81. GAATTCGGCA CGAGTTTTAC ACCATCA«I AaSSS SfHS" 0 GAGCTCGACG GTOTOSATAA GCTTGATATC 80 

61 ATGAAAAGCA CCACCTGTTG CTCACATTCT £2£cS £3^1^ ^™«*T AGAGTTOCCC -ACT^S^ «D 

4' GTCGTTGAAA CCCAAGTTGG CTACTcSS CTTcSS^ Sf!?^** 0 TCAA£JTAAAG GAAGCACGAA GAASAOGsS =!n 

GGTCTCGGCG AACCTTCCTT OGGGCTATCT S£Sa J^5*f™* **«»rGGTG ACAAGCGAGC AGCACATC^ 3=n 

PGGTAGAT BG AGGCAAOCCA CTCCTCAAaI ^^^SS^ g^"^ TGGGCAGGCA TTATOGTCOS gW?I££ «0O 

... ArrrrrrccA gtactotcag aaaacogaat ISS^ rrcr acwk «atac tcagqatcag cmtScatI 

i6i CATGCGATGG AAGGCCACGT GATGATCGCC S^oS ^SZ^ AACGAAC1TG TAAAGTACCT T^AGAGTCTG Iso 

541 ACAGGAAGAA GTCGCGSTTA ACGTOACTOG MhSSSS ? ATCCTW CCAGCTGTTC CGAGTCCTCA CCAGAacSS 

721 ACTTGAGGTC ATATCTTAAG TACGOCTA^ A^gSgS ^^2^ CCCAGT «^A TGAGGAAGGA TTGGAGAGCC 7=n 

B01 ACCAAATCCA TGACCACGAT TCToScS '"^^ AGACAGTOCA TCAAGAACTG GOO 

881 CTTTGATGTA CTGATCAAAT CTATQGCTCA GCATTTGMTl CMkCAWkCTA CTGAGGTACT oSgottt?? 8SO 

961 CIGCATCCWl TCATCATOCA SCGGAAACWS TralISS Sf^*" 00 * AW3TTAM?TT GCIGOGAAAC CAGAGATr££ 

041 GAGGCATCTA AGAACGCGAA TCM1«CCT? GctS^I S^ TCC ° k AGAAGTTTGG AGATAATCCA 

.121 CAAGCAGATC • AACAACTACA TTAGCTGTTT £S££££a" S^f™ 3 TTOACCTrC ATGGACAGGG GCTTTGTCTT l^n 

.201 GTGTAGTGTG CAACCATGAA CATTATA1TC CGraSSS SS^**** CCCTCTTWSA ATACAAGTTT GAMTTCtS ™ 

.281 GACCTCCAGC TTGACTACTC TTXGGAAAAG GCAGGATTCA AAGaSS^ 13«n 

GACAGCCCTC CAGGAGTTCC GGGAGGTGCG ^^SS CTTGGTGGGA CTCTTACXtSA GGGAGGTGQG llll 

ATGACAGATA TGCTTCAAGG AGCCATrarar r»«^«^ZI ATCAGTGT6C TCAAGAACCT GCTGATAAAG gJnr rTT . 360 

CTCCAGCGGA TCAATCtSaI SSgSS S*^CTCTAC CTGCCTCTOT TTOGtSSc? S^AAAaI «2£ 

ACCAGCTGTG AATCCGCTGG TGA0GC0GCA SSS^S I^*f52° QG "^CCCTG AAGGATCAAT CCCTOOCT^ 

TCTCCGGCAT TOCircreCA TATACaS SS^SS acagcctgc * CAAGGACCW cSSS^SS l««n 

ATAAGCACAG A7 TCOGOTA A CAQCCTTOCA SSS^ ST^^ ^""^ AGGATC^CTC 17*n 

— - CACATTGGGA AATTCCGTGG TTCSCTBTGA TAAACTKsS SSS^ 5 " 3 ^*^ TCCCTG GATAAGCACC AACAAAgSc ^Zf° 

1921 TCTTAAAGAG CATGTCTGAT GATCCTTTOT S^ilSf "GTCTGAOA TTAAGAGCCT ACTGATGTGT TTeerSSSi? " 4 ° 

3001 ATATCTC3AAG TCTGCCTBCA ££££££ TCAACATCTC AACTTA^gS SS^S 

2081 gTTCATGAT CGAAAGTCTC AGACATTCCC TGTTO^ i^Tt^I AGCCAGGAAC CABGAOGGGT ^GGa££££ llln 

2161 GCAGCCTGGA TAACTCTCTC ACTZTTAACC IS^i^ AACAGAACAG GAATSATGCA TGCCAGATTG CAQCin^^ 

2241 ^CAACATTC CTACTOAGGT TTCCCTGACA SSSS ^ ACT ° GaMC «CAGATGTTC TG<S SS^SS ZllZ 

2321 I^GCCGACCAT GGACATAATC CTCTCATGAA iSSSS^ CGCTTTCTCT ATTTACATTG GOGTTTAAGA j£2S25i£ " 4 ° 

2401 jOSGCTTT AAA AAATBTCTTC Ac£S£*t ^^CT^CC -IGI KH ' I ' VIW ?SaIaA^. SgSS *"? 

2481 WOTOTGCOS CTCTOTOTtA CGAoS^ ISSJSSZ CCCTCAACAT ^CATGAAQG GAG^gSS 

2561 BCTCTACTrC CTOATGAGGA ACAACTTTOA TtK£S£ ^SSS! OAOCJOC ATC AGGAOGGAQG CCTC^CAOCT Itll 

264imroTCAGCCA GCTGATAGCA GACGTTCTTG gS^SSS tJSSSISE TrGTCCGGA C ACATTTOCAA CTCATCA^S =«SS 

2721= GCCAACAGTG ACOQGCITAT TAAQCACACC AqSS22S AA 5° >GArrC CAGCAGTCCC TGTCCATCAT CAAcS^r^ 

2801^VATGGCCACC GCCCAGAT5A AGQAGWOA SaS^ 5 TOATOTGA » GGAC1TAACC AAAAGG^AC G^SSSS ¥*° 

2881^TGCCAGCAC GCCCGAGCTC ATCAAGaSS G^SSSS GAGATOCTOG TOGACCTCCA GtoSgotS SSS^ 

2961 GCAGCAATOT GCTATOTCCA OCTAACaS ^SS^ CATG3CC»OG ATCCATGTCA AAAATOGCGA 

304f OGCCrrCAGG GTCATTACCC OAA^S jJATATCTCAC ACGGAAAOGC GTOnTAGAC ISSgSS 

312 J - ACGAGGATGT GCTOATOGAG CTCCTTCAOC TCCATGATOQ AAGACGTGGG GATOCAGGAT GTcSSSS I ° 

3 2 0r ATCTACAAAC TTATCATCCC S^S^ A^SSS T??*^^ AAAGCCGAGC 0^1^ 3=On 

328X GGAACCCAAA CTCACACCOC TOTCGGAAAT TT^rnSS? AGATGAAGAT OGAAAGGAGT M^T; "°° 

3 3 61*TCAAAATGAT ACAGGAl^T G3»AGG^ SSSSS Z******** ^CTOSGA tSaIIStogT 7^HSS 

344§|ATCCCCrPCr TTGACGAAAA AOAOTG^ G^SS^ I CTQGATrCr AAOTATGCAT ACAtcSooT GA^S^S " 6 ° 

3 5211 TGAGATGCCA TTTACGCAGA COQGGAAGAG G^A^GQCGOG GTOOtir^««^ OASATCCCAC AAC^T^^GCC GCTTC^rGTT 352D 

3601 TACACTGCIT CCCTTATGTG AAGAAGWtt TOCCimSS S^SJf 0 A^SCM^CG GCGCACCATC CTOACAOcS ««« 

36 W ATrGACGAGA TGAGIAAGAA QGTGGC^Q C^GG^ ^^AC CACACTGACC TtlAACCCCAT SaGOT^S 

37tl CMACKOS OGCM>C6K» OTOtS JSSS 'Z^?* 6 ™ AACTgSI^ «c2 

3841 CAAAGCGATA TCCTGACAAT AAAGTOAAGC TOmSI^ «=^»^ T ATGCOOGAOC TTTCTTAGAT GATACAAaS «!S 

3921 GCGGTAAAOG AACGTCTGAT TAAAGAAGAC CAgSS^ ^S^T* 00 ^"TCTOQ AAGCTTGCOG TOMWBC^ a«S 

4001 GGAGcrncr gaaatcatoc atgaocasat SSSSSSI *tcaggaaga aatoaaagcc aacxacagog aaat^AT^ 3 ? 20 

4081 ^AAC^CCAT CAGTGGGACT Sa^^A C ^^ZZ ^ 

4161 atggccogtg tbtooggact tcottgtS tog^HS SSS atgm:c AOCTOGicrr oegTcoi m t? atiSamS.!?!? 

4541 cagggaggac caagoggaag gogagagaaa Saa^SS ^£! SC<A KK ^ I ^ ^^S.\l^ 

4321 AGAAGCTGCA CATATTTTTT TAAATCTCAC TOcS ^^S^ T^^Efl QACTTTCTAT AGGAOTTG^A 4330 

4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTOCaSI ATTGTGTCTT AACAAAGGTC TOGTAGAoS All 

4481 ATCTACATGG CAGGCAGQGC TGCACTGaS SbSSS^ SST rRGMCT ASA '««CTA CAGAAAAAAA AOGITCrSe 

4561 AATCM3GGTA CAGAACTTAC iSgtS^GTC 7AGGAQTM« WCCTCGACT OGTGCOGwS «T^4SS? 

4641 ATAGAGCAAG AATAGTGAGC TAaSS A^S^ SJ^SS ^T TCTG ATtSaCA^ «!n 

4721 TCATCGACTC CGGGACGGTC ATATA7X3TAT Stt^ IT^I^SZ *™GCTTCA AGTCAGAACT TIVKXrrt* IwS 

4801 TOTGATAAAT TTCTOCTGGT CCAGTATATC SS? gfT^T^! ^f?"*™ 6 GCTTATB CAT TAAOTTW «O0 



4881 



TQFTATACAAG TCTTTACT ^ATACACTT TAATGGTTTA TTCTTCTCAT AAAAATGTGC 

I 80 
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\ » . i» ^UJi v»cL^ s^sS SSSSS SSSS 
S5SS 535KS SSSSS ££££ 35S g-g-j SSSSR S 

"ssss: ssss »»*^ s "? 

I sss; ssss ss=£ sags ssss sssss; 

1 SS SSSS£| 3--K SS SSS ^—SSS K 

1 ^LlKHTSF SSPVKDLTKK TWOTRSM MEDVGMODVH S£SuJSiar KVXQVTHVIF 960 

; ss S S s^s ™ S ssss-^se ^ 
gS^.s a rS=r5 ^ -r-s -r* ""^ 
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61 
II 
21 
01 
81 
61 
41 
21 
.01 

;bi 

141 
?21 
301 
381 
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241 
321 
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481 
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i 10 
AATTGTAATA 
GAATTCGGCA 
ATGAAAAGCA 

GGTCTCGGCG 
GGGTAGATGG 
ATTTTTTCCA 
CATGCGATGG 
ACAGGAAGAA 
ACTTGAC5GTC 
ACCAAATCCA 
CTTTGATGTA 
CTGCATCCi'A 
GAGGCATCTA 
CAAGCAGATC 
GWA ' A G T GT U 
GACCTCCAOC 



| 20 
CGACTCACTA 
CGAGTTTTAC 
CCACCTC3TTG 
CCGAAGTTGG 
AACCTTCCTT 
AGGCAAGCCA 
GTACTGTCAG 



GTCGCGCTTA 
ATATGTTAAG 
TGACCACQAT 
CTGATCAAAT 



I 30 
TAGGGCGAAT 
ACCATCACCA 
CTCACATTCT 
CTACTCCTQG 
CGGGCXATCT 
C7GCTGAAAA 
AAAACC6AAT 
GATCATCGCC 
ACGTCACTCG 
TACGCGTATA 




40 

TSGSTACCGG 
AAACCCAGAA 
TCCATGTCAG 
CTTCCCCTCC 
TGGCTACCAA 

rrircAcrcA 




I 50 I 60 I _ 

CCCCC CCCTC GAGGTCGACG GTATCGATAA 
TTTTATGATG AGATTAAAAT AGAGTTGCCC 
CTGJUACAAC TCAAGTAAAG GAAOCACGAA 
TCAAAGACGG AAGGGTGGTG ACAAGCGAGC 
GAGCTTOGG A TGGGCAGGCA TTATGGTCCG 
TCTGGmvr ACAGGGATAC TCAGGATCAG 



I ao 



TTCTTGCCCA 
GGTCATTATT 



CTATGGCTCA 



AGAACGCGAA 
AACAACTAGA 
CAACCATGAA 



TCATAGCCTT 
TTAGCTGTTT 
CATTATATTC 
ATTAACAGAT 



TCTCCOGATT 
GCATTTGATA 
TTGTAAATAT 
GCTGTCTTCA 



CTATCCTAAA 

CATGI GUATbt 
ATATGTTBCC 



GAAATTAAAT 
CATTTACATA 

TAAGAGTCTG 

CCAGCTGTTC CGAGTOCTCA CCAGAGCCAC 



TCTGAATACA AGACAGTGCA 
CAACAAACTA CTGAGGTACT 



GAGAACTCCA AAGTTAAGTT GCTGCGAAAC' CAGAGATTTC 



CGTTGAACTT ACCAATCCCA 



ACCAGCTGTG 



ATGACAGATA TGCTTCAAGG AGCCATCAGG 
TCAA7GTGAG GGATGTCTCA 
AATCCGCTOG TGACGCGGGA 

tccttctcca tatacaacct 
attcgggt a a cagccttcca 

AATTCCGTGG TTCGCTGTGA 
CATGTCTCAT GATGCTiUUT 
TCTGCCTGCA CCAGTTCCAG 
CGAAAGTCTC AGACATTGCC 



CAAGGATAGC 

cccrrcccTG 



ATAAGCACAG 
CACATTGGGA 



AGTTCATGAT 



T7ACATA7TG 
TACATO QQGA 

TOrrrcccGT 



GCASCCTGGA TAACTCTCTC ACTTTTAACC ACAGCXATGG 
CCCSACATTC CTACTGAGGT TTGCCTOACA GCTCXGGACA 
k - GGACATAATC CTCTCATGAA AAAAGTTTTT 

AAATGTCAT L! ACTGCCTTAA GGTCCTIAAT 
CTCTOXGTTA CGAGATTCTC AAGTGCTGTA 



CGGgTTTAAA 
ATGTGTGCGG 



CTGTfcAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 



A AgQg CCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
ATCGCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 

GCAGCAATGT gctatgtcca cgtaacagcc ctagtggcag 



ACGAG GATBT GCTGATSGAG CTCCTTGAGC AGTGCGCAGA 

^S^S^-^SSSS^ CMTXATOAG AAGCGGAGGG 
GGAACCCAAA CTCACACCGC TGTCGGAAAT TICTCAGAGA 
TCAAAATCA T ACAQ GATTCT GGCAAGGTCA ACCCTAAGGA 
ATCCCCXTCT TTGA CGAAAA AGAGT7GCAA GAAAGGAAAA 
TCjA GATGCCA TTTAOGCAGA CCGGGAAGAG GCAGGGOGGG 
TAGACTGC1T CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
ATgj SACOAG A TGAGIAAGAA GGTGGCGGAG CTCCGGCAGC 
CAAACTCCAG GQCAGOBTGA GTCTTCAGGT CAATGCTGGC 
CAAAGCGATA TCCTGACAAT AAAGTCAAGC TCCTXAAGGA 
GCGGTAAAOG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
GGAGCITTCX GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
TCAAOGCOT CAGTGGGA CT CCAAGAAGGA CAATGGTTCA 
ATGGCCCGTG TGTGGGGACT- TGCTTTGTCA TTTGCAAACT 
CAGGGAGGAC CAA GGGSAAG GGGAGAGAAA GGAAATAAAG 
AGAAGGTOCA CA 7A1 1 A'A'AT TAAATCTCAC TGGCAATATT 
TCrTGAGCTG GACTTAGATT TT JflTL 'AT C C TTGCAGAGTA 



AATCAGGGTA CAGAACTTAC TAGTT3TCTC XAGGAGTATtS 
ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
TCATCGACTC 0GGGACGG7C ATATATGTAT TACATTTCTA 
TG1UATAAAT 1AUAUCAWA' CCAGTATATG CAATACACTT 



I 



10 



I 



20 



I 



30 



I 40 



ATCAGTGTGC 
CACCCTCTAC 



CACATCACTC AGAAGTTTGG 
■ llUVALCi ' AV ATGQACAGGG 
CCCTC3TZX3A ATACAAGJTZT 
TTTGGAAAAG GCAGGATTCA 
CTTGGTGGGA CTGTTACTGA 



GcrrrcTcrr 



CTGCCTCTGT TTGGTCTGCT 
CATGACCGTG AAGGATGAAT 



i2S5E?2 c^^ccro ciuoo^S 



CTGAGAAGAG CAA3TCCCTG GATAAGCACC 
TTAAGAGCCT ACTGATGTGT 



AGC6ATACAT AGCCAGGAAC CAGGAOGOGT 
AACAGAACAG GAATGATGCA TGCCAOATTO 
CCACTOGGAC GCAGATGTTC TGCACCAGTC 
C GCA A'lV AVf ATTTACATTG GCGTTTAAGA 
GATGTC TACC T G J U i ' A ' i ' AVA ' TCAAAAACAT 
TTATAAGTTT CCCTCAACAT TCTATGAAGG 



TTCCTCXACA 
TA Al A A'AACA 



XXGMO TCCr TTGTCCGGAC ACATTTOCAA 
AACCAGATTC CAGCAGTCCC TGTCCATCAT 



TCCATGATGG AAGAOGTGGG GATGCAGGAT 
TGGMCTCTQG AAAGCCGAGC GCTAGGAOCT 
A'il ILi' l'lWl AGATGAAGAT GGAAAGGAST 
CTCCA' i'AAAC TC1AC 1 VUU A TAA A1 1 AW T 
Td^ ttAHV T AAGTATGCAT ACATCCAGOT 
CAGAGI 1 1UA GAGATCCCAC AACATCCGCC 



OCT 



80 

160 

240 

320 

400 

480 

SCO 

640 

720 

800 

880 

960 

1040 

1120 

1200 

1280 

1360 

1440 

1520 

1600 

16BO 

1760 

184 O 

1820 

2000 

208O 

2160 

2240 

2320 

2400 

2480 

2560* 

264 0 

2720 

2B0O 

2880 

2960 

3040 

3X20 

32O0 

3280 

3360 

3440 



WlU-'iaW GGCCGAGGTO GACATGATCA 



AGA i'l'JXIAGG CAA7TTGTGG AAGCTTOCOG TCAAGCC 



ggg ^^^ C CTOACAOCCA 3600 
TOAACCCCXT OGA GG I U U C C 3680 
^ACATGATCA AACTOCAOCT 3760 
rr CA'A'AG AT GATACAAACA 3840 

m 3920 



CGGGATGACC AGCTOGTCTT 



AACAACGTTA 
CAAAGTTTTC 
GTGTTAGAAT 
GGOGACCTTT 



TTTCTTAACA 
ATTGTGTCTT 



TT A A TC O UCA ' 
CATTTTTAAT 
TAATGGTTTA 



TGCCTOGACT 
AGGATTTGTG 
ACTGGCTTCA 



I 



50 



TTCITCTCAT 



60 



**GT* G*BQG AAATOGCGAA 4000 
ACCGAATTCC CTTCACATCT 4080 
C GG TCG A VAt» ATOkCAirxC 4X60 
AT«CTOGGO-AGACOGAGCA 4240 
GACi A-AUTAT AGGAGTTCXA 4320 
**C*A*GGTG TGGTAGACAC 4400 
<3*&A**** A AOGTTCTGQG 4480 
COIIKX UUAA ATCTGATCGT 4560 
CTAT AA1 XJVC ATTCAACAAC 4640 
AGTCAGAACT TTGAUIU'AAA 4720 
GCTX ATGCAT TAAGTTTAAT 4800 

AATATGQAGA. 4880 

I in i 4893 

I 70 | 80 
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I 1° 

HVMIAFIj 
TTUCPSA 
IANHSIAV 

*VRDVSPF 
5GNSUER 
CLHQFQW 
TEVCLTAIj 

jRX»IKHTSF 

-yvhvtmjv 
jipxyekr 
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I 20 

pflJTSNKULR 



CRjtKFLVGU* 
GJ&YXASMQE 



I 30 
1.TRATQEEVA 
YSWFFFPVLI 
KGFVTRQIKM 



CHSKLSSXRT 



ESIAUWNP 
HQQSST1CNS 
GliGPIVHDRK 

knql1abkgh 
easqu/stflm 
iktvu1ataq 



VNVTKVTIHV 

YISCFAPGDP 
FREVRLXAXS 
IiVTPQKGSTL 
WKCDKIjDQS 
SQTUVSRMR 
NP1/4KKVTDV 
RNNFDYTGKK 



| 20 



EYITKEFXLT PLSEISQRLL 
KRIWTEMPFT QTGKRQGGVB 
IJCLQIJaUOGS VSVQVKAGPL 
KEMATOUSEX MHEQICPLEE 
| 30 | 40 
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VW ^ E J1 SHL*Uk» "o 

TSSSSS 

m-FEYSFEF ££££5 320 

V^NliiraS S^^SVR HMJSRGSLXS 400 

DNSLHKDLU3 AISGIASPXT ™*^^r_ fiEYjrnpKTXS 4 80 
EZUUHGFI, Vl^SH^ ^S££2* 560 

^StRPQQ 720 
t^S^S SSSS* WI^SMARXH VWGDfcSEAA 800 

S£S£ ■»;; 

EQCRKKTXLT AXHCFTYV** "'—jlLoy VKACGQA1AV XX20 
AYARAP1J3DT ^Y^KV ^S^*^ 1 19 4 

KTSVLPHSLH IFIMSGTFT STWVHGMT5S 00 ( 8Q 
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I 10 
AATTGTAATA 
GAATTCGGCA 
ATGAAAAGCA 
CTCGTTGAAA 
GGTCTCGGCG 
GGGTAGATGG 
ATTTTXTCCA 
CATGCGATGG 
ACAGGAAGAA 
ACTTGAGGTC 
ACCAAATCCA 
CTTTGATCTA 
CTGCATCCTA 



J 20 
CGACTCACTA 
CGAGTTTTAC 
CCACCTGTTG 
CCCAAGTTCG 

AAccrrccrr 

AGGCAAGCCA 
GTACTGTCAG 
AAGGCCACGT 
GTCGCGGTTA 
ATA1TJTTAAG 
TGACCACGAT 
CTGATCAAAT 



I 30 
TAGGGCGAAT 
ACCATCACCA 
CTCACATTCT 
CTACTCCTGG 
CGGGCXATCT 
CTGCPGAAAA 
AAAACCGAAT 
GATGATCGCC 
ACGTGACTCG 
TACGCGTATA 




40 

TCGGTACCGG 
AAACCCAGAA 
TCC&TGTCAG 
CTTCCCCTCC 
TOG CTACCAA 
TTTCCACTCA 



I 50 | 60 

OCCC CClViV GAGGTCGACG 
Ti'lTA TCATG AGATTAAAAT 
CTCTGACAAC TCAAGTAAAG 




I eo 

GTATCGATAA GCTTGATATC 



TTCTTOCCCA 
GGTCATTATT 



CAAGCAGATC 
CTGTAGTGTG 
GACCTCCAGC 
GACAGCCCTC 
ATGACAGATA 



AGAACGCGAA 
AACAACTACA 



CTATGGCTCA 
GCGGAAACCG 
TCATAGpCTT 
TIAGCTGTTT 
CATTATATTC 



TCTOCCGATT 
GCRT TTSATA 
TTGTAAATAT 
GCTGTCTTCA 



TCTUal'i'JVr 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTC 
-MATGTTT5CC 



TTATGGTCOG GAAATTAAAT 



CGTTGAACTT 



CAGGAG7TCC GGGAGGTCCG TCTGATCGCC 



AATCCGCTGG 
TGCTTCTCCA 
ATAAGCACAG ATTCGGC5XAA 
CACATTGGGA AATTCCGTGG 
TCTTAAAGAG CATGTCTGAT 
ATATCTGAAG TCTGCCTT3CA 
AGTTCATGAT CGAAAGTCTC 
TAACTCTCTC 



GGATcmrrcA c ce.uu.vm 

TGACGCCGCA GAAGGGAAGC 
TATACAACCT CAACTCCAAA 



SATGCTTTOT 
CCAUL-JVCAG TACATGGGGA 
A GACATI T3CC lUA ' AIUAW 
ACTTTXAACC ACAGCTATGG 



GGCOSACCAT 



ATGT^TGCGO 



GGACATAATC 
AAATGTCTTC 
CTCTGTGTTA 



GCTCTGGACA 
TGAA AAAAGTTTTT 
GGTCCTTAAT 



C1U1VAGCCA GCTGATASCA GACGTTGTTG GCATTGGGGA 



AATgGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
ATOCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
GC AQCA ATGT GCTATCTCCA CGTAACAGCC CTAG1XSGCAG 



ACGAG GATGT GCTGA7T3GAG CTCCTTGAGC AGTGCGCAGA 
ATCXACAAAC TTATCATCCC CATTTATOAG AAGCGGAGGG 
CTCACACCGC TGTCGGAAAT TTCTCAGAGA 



ATCGCCTTCT TTGA OGAAAA AGAGT7GGAA GAAAGGAAAA 
TGftGATSCCA TTTAGGCAGA CCGGGAAGAG GCAGGGCGGG 
CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
ATjgACGAGA TGAGTAAGAA GGTCGCGGAG CTCCGGCAGC 
CAAACTCCAG GGCAG CGTGA GTCJTCAGGT CAATGCTGOC 
CAAAGCGA1A TCCTOACAAT AAAGTQAAGC TGCTTAAGGA 
GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 

GGAGcrrrcr gaaatcatgc atgagcagat ctgccccctg 

TCAACGCCAT CAGTCGSACT CCAACAAGCA CAATCGTTCA 

atggcccgtg tgtcggoact tgctttctca tjtgcaaact 
cagggaggac caaggggaag gggagagaaa ggaaataaag 
asaaggtsca catottttt taaatctcac 1x3gcaatatt 

TCrPGAGCTG GACrrAGATT T lAri Ll lUJ TTGCAGAGTA 



AATCAGGGTA CAGAACTIAC TAGTTTTGTC TAGGAGTATG 
ATAGAGCAAG AATACTGAGC TAACTGAGCT AGACACTCAA 
TCATCGACTC CG0GA C9GTC ATATATGTAT TACATTTCIA 
TGTG ATAAAT TUGIUJIUUT CCAGTATaTO CAAXACACTT 



I 



10 



I 20 



I 30 



I 



40 



GAGAACTCCA 
GCTGATGCCA 



TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG TAAAGTACCT TAAGAGTCTC 
CCAGCTGTTC CGAGTCCTCA CCAGAGCC&C? 

CCCftG lOC CA 

TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACCTTC 
CCCTCTTTGA 



CTSM5Gx ^Cr CATGGTTTTT 



XG&AGTTTGG AGATAATCCA 
GCXTTGTCTT 



GAAACCACTT CTTGGTGGGA 
ATCAGTCTGC TCAAGAACCT 
CACCCTCTAC CT B CC XCiW 
TCAACGCGGG CATGACCGTG 
W=CC1XSGAC^ ACAGCCTGCA 
CATCAACACT GTGAGAAATG 
GTGAGAAGAG CAATTOCCTG 
C^GTCTCAGA TXAAGAGCCT 



CTGTTACTGA 
GCTGATAAAG 



CTOGGCGCCA 



ACTGATGTOT TTCCTClAC3k 



S^SSS Trrrrt^S 



GAATGATGCA 
CC ACTC GGAC GCAGATGTTC 
OBClliUvi* ATTTACATTG 
GATGTCTACC - Tttiumwr 
TTATAAGTTT CCCT CA ACAT 



AAGAAG ItVJ' TTCTCCGGAC 
AACCAGA1TC CAGCAGTCCC 



CCTCCCAGCT 



TGTCCATCJttT CAACAACTGT 



^^TS TGGACCTCCA ^CAOCCTO OCCAAATCCT 



AATOICTCAC ACGGAAAGGC GTGTTXAGAC AAGGATOWC 



TOGACTCT OG AAAGCCGAGC 
AT A lll lTUA AGATCAAGAT 
CrcCITAAAC TGTACTOGGA 



TAAATTTGGT TCTGAAAATO 



CAGAG^^ GAGATCCCAC AACATCCOCC GCTTCATSXT 



80 

160 

240 

320 

400 

480 

560 

640 

720 

800 

880 

960 

104 O 

112 O 

1200 

12 8 O 

1360 

144 O 

152 O 

16QO 

1680 

1760 

184C 

1520 

2000 

2080 

2160 

2240 

2320 

24 OO 

2480 

2560' 

2640 

2720 

2800 

2880 

2960 

3O40 

3120 

3200 

3280 

3360 

3440 



AGTTTTCAGG CAATTTGTGG 
AATGAAAGCC 



ATGCGCGAGC TTTCTTAGAT GATACAAACA 



AGCTCGTCT 



AACA ACGTT A TTTCTTAACA 
CAAAUl iJ-IU ATTGTGTCTT 
G i Is'iTAGAAT AGATGGCCTA 
TT 



CGGTCGTOTO 
ATCACTCGGG 

GACTTTCTAT 



AGGTTCTGGG 



1TOTATGACT AGGATTTGTG CXATXATCTC ATTCAACAAC 



CAT 'lU'lTAA T ACTCACATGG 
TAATGG1IUA TTCTTGTCAT 



! 



50 



I 



60 



I 70 



I 80 



3600 
3680 
3760 
3840 
3920 
4000 
4080 
4160 
4240 
4320 
4400 
4460 
4560 
4640 
4720 
4800 
4880 
4898 
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| 10 
HVMIXFT. 

VSLTDEF 
fVTOVSPF 

XHQFQVM 

KL1WTSF 
YVHVTALV 

DEKEX>QER 



t 20 
pXllNQLFTCV 

fXOT.CTTFMD 

pVNAGMTVJO) 

CNSKLSSXKT 
SS0VKDWKR 

leTEFERSHHI 
QfcCSSAKVEH 

| 30 



| 30 
liTRATQEEVA 
YSWFTFDVLl 
RGFVFKQ2OT 

HQQSSTLGNS 
GUGPXVKDRK 
JCNQLLADHGH 

IKTVXMATAQ 
RQGCTAFKVI 

RRFMTTSMPFT 



| 30 



VHVTKVIIHV 
KSMAQHLIEN 

FBTVULIAIS 
IOTPQKGSTL 

sornjpvsRMR 

RKNFDYTGXK 

TFNXDEEfcSM 

QrrGlCRQGGVE 
V5V0VKA6FL 
KHEQICPLEE 
| 40 



1 50 
VAQCHEEGXX 



KKSLlHCFLi 
TGMMHMUOQ 

KLYSDKFGSE 
EQCKRKXIL.T 

KTSVliPNSUJ 
| 50 



i co I 70 r " 

FPASYHKAWE TWHMLMFHX TQJTCCWPER 

^wo*m£ iPLKUPHPro imowoW; 

^^OT QMOWMn* i«UAEHVQ 

VTLXSMSDDA LPlWHiaST SEXH0FFTIS 
KHSV^^ V^^H 

ctalkhvf» uisunrrrps ™Whc 

MUM* VGJCETWQQ 

syastfei** wuwuaaa vkwcto-sk^ 

SeII EQCMXSIMKA KRY^MTY 
flVOTlQDSGK VKPTOLDSW MaQVlWXF 
^PHIJTYVTK R1PVKYQHHT 
^rTOHKV ^^^ ^^^ 

| €0 I 70 I *™ 
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960 
1040 
1120 
1194 
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81 
161 
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1041 
1121 
1201 
1281 
1361 
1441 
1521 
1601 
1681 
1761 
1841 



I 10 
AATTGTAATA 
GAATTCGGCA 
ATGAAAAGCA 
GTCGTTGAAA 
GGTCTCGGCG 
GGGTAGATGG 
ATTTTTTCCA 
CATGCGATGG 
ACAGGAAGAA 
ACTTGAGGTC 
ACCAAATCCA 
CTTTGATGTA 
CTGCATCCTA 
GAGGCATCTA 
CAAGCAGATC 
GTGTAGTGTG 
GACCTCCAGC 
GACAGCCCTC 
ATGACAGATA 
GTCCAGCGGA 
ACCAGCTGTG 
TCTCCGGCAT 
ATAAGCACAG 
CACATTGGGA 
152 1 r TCTTAAAGAG 
2 0 O&ATATCTGAAG 
2 0 8g AGTTCATGAT 
2 16,3c: GCAGCCTGGA 
2 2 4M rT GCCAACATTG 
23230 GGCCGACCAT 
24Cfc|.= CX3GCTTTAAA 
24 Bjg ATGTGTGCGG 
25 GCTCTACTTC 
264JL| CT6TCAGCCA 
27?X GCCAACAGTG 
280£ AATGGCCACC 
2881 ATGCCAGCAC 
GCAGCAATGT 
CGCCTTCAGG 
ACGAGGATGT 
^ ATCTACAAAC 
3281 GGAACCCAAA 
33fel TCAAAATGAT 
34*1 ATCCCCTTCT 
35^J TGAGATGCCA 
TACACTGCTT 
ATTGACGAGA 
CAAACTCCAG 
CAAAGCGATA 
GCGGTAAACG 
GGAGCTTTCT 
TCAACGCCAT 
ATGGCCCGTG 
CAGGGAGGAC 
AGAAGGTGCA 
TCTTGAGCTG 
481 ATCTACATGG 
561 AATCAGGGTA 
ATAGAGCAAG 
TCATCGACTC 
TGTGATAAAT 
TGTATACAAG 
I 10 



29£1 
3 041 
31@ 
32.01 



3601 
3681 
3761 
3841 
3921 
001 
081 
161 
241 
321 
401 



641 
721 
801 
881 



I 20 | 30 | 40 

CGACTCACTA TAGGGCGAAT TGGGTACCGG 
CGAGTTTTAC ACCATCACCA AAACCCAGAA 
CCACCTGTTG CTCACATTCT TCCATGTCAG 
CCC AAG TTGG CTACTCCTGG CTTCCCCTCC 
AACCTTCCTT CGGGCTATCT TGGCTACCAA 
AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
AAGGCCACGT GATGATCGCC TTCTTGCGCA 
GTCGCGGTTA ACGTGACTCG GGTCATTATT 
ATATGTTAAG TACGCGTATA AGGCTGAGCC 
TGACCACGAT TCTCAAGCCT TCTGCCGATT 
CTGATCAAAT ■ CTATGGCTCA GCATTTGATA 
TCATCATGCA GCGGAAACCG TTGTAAATAT 
AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
AACAACTACA TTAGCTGTTT TGCTCCTGGA 
CAACCATGAA CATTATATTC CGTTGAACTT 
TTGACTACTC ATTAACAGAT GAGTTCTGCA 
CA GGA GTTCC GGGAGGTCCG TCTGATCGCC 
TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
TCAATGTGAG GGATGTGTCA CCCTTCCCTQ 
AATCOGCTGG TGACGCCGCA GAAGGGAAGC 
TGCTTCTCCA TATACAACCT CAACTCCAAA 
ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
AATTCCGTOG TTCGC1X3TGA TAAACTTGAC 
CATGTCTGAT GATGCTTTGT TTACATATTG 
TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
TAACTCTCTC ACTTTTAACC ACAGCTATGG 
CTACTGAGGT TTGCCTGACA GCTCTGGACA 
GGACATAATC CTCTCATGAA AAAAGTTTTT 
AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
CTGATGAGGA ACAACT7TGA TTACACTGGA 
GCTGATAGCA GACGTTGTTG GCATTGGGGA 
ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
GCCCAGATGA AQGAGCATGA GAACGACCCA 
GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
GTCATTACCC CAAACATCGA CGAGGAGGCC 
GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
TTATCATCCC CATTTATGAG AAGCGGAGGG 
CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
ACA GGATTCT GGCAAGGTCA ACCCTAAGGA 
TTG ACGAAAA AGAGTTGCAA GAAAGGAAAA 
TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
CAGTGGGACT CCAACAAGCA CAATGGTTCA 
TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
CATATTTTTT TAAATCTCAC TGGCAATATT 
GACTTAGATT TTATTCTTCC TTGCAGAGTA 
CAGGGAGGGC TGCACTGACA TTGATGCCTG 
CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
AATAGTGAGC TAACTGAGCT AGACACTCAA 
CGGGA OGGTC ATATATGTAT TACATTTCTA 
TTGTGCTGGT CCAGTATA7G CAATACACTT 
TCTTTACT 

I 20 I 30 | 40 



| 50 
GCCCCCCCTC 
TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCTTGGGA 
TCTGGTTTCT 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTG 
. ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
TGAACGCGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGCGATACAT 
AACAGAACAG 
CCACTCGGAC 
CGCTTTCTCT 
GATGTCTACC 
TTATAAGTTT 
ACTCCAAGCT 
>AAGAAGTCCT 
AACCAGATTC 
CTGATGTGAA 
GAGATGCTGG 
CATGGCCAGG 
AATATCTCAC 
TCCATGATGG 
TGGACTCTGG 
ATTTCTTTGA 
CTCCTTAAAC 
TCTGGATTCT 
CAGAGTXTGA 
GTGGAAGAGC 
6TACCAGCAC 
TGTGCTCCTC 
CCACTAGCAT 
AGTTTTCAGG 
ATCAGGAAGA 
GAGGAGAAGA 
CGGGATCACC 
CAGGATGCTT 
AACAACGTTA 
CAAAGTTTTC 
GTGTTAGAAT 
GGGGACCTTT 
TTGTATGACT 
TTAATCCGCT 
CATTTTTAAT 
TAATGGTTTA 



I 



50 



| 60 
GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACCTTC 
CCCrCTTTGA 
TTTGGAAAAG 
CTTGGTGGGA 
TCAAGAACCT 
CTGCCTCTGT 
CATGACCGTG 
ACAGCCTGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAACATCTG 
A6CCAGGAAC 
GAATGATGCA 
GCAGATGTTC 
ATTTACATTG 
•TGTGTTTTCT 
CCCTCAACAT 
GAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 
ACGGAAAGGC 
AAGACGTOGG 
AAAOCCGAGC 
AGATGAAGAT 
TGTACTCGGA 
AAGTATGCAT 
GAGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGTG 
ATGCGCGAGC 
GAATTTGTGG 
AATGAAAGCC 
CGAGCGTCTT 
AGCTCGTCTT 
TCCAAAGCCA 
TTTCTTAACA 
ATTGTGTCTT 
AGATGGCCTA 
T6CCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACATGG 
TTCTTGTCAT 

I 60 



| 70 
GTATCGATAA 
AGAGTTGCCC 
GAAGCACGAA 
ACAAGCGAGC 
TTATGGTCCG 
TCAGGATCAG 
TAAAGTACCT 
CGAGTCCTCA 
TGAGGAAGGA 
AGACAGTGCA 
CTGAGGTACT 
GCTGCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 
GCAGGATTCA 
CTGTTACTQA 
GCTGATAAAG 
TTGGTCTGCT 
AAGGATGAAT 
CAAGGACCTG 
CTGATTCGAG 
GATAAGCACC 
ACTGATGTGT 
AACTTATGGA 
CAGGAGGGGT 
TGCCAGATTG 
TGCACCAGTC 
GCGTTTAAGA 
TCAAAAACAT 
TCTATGAAGG 
AGGACGGAGG 
ACATTTGCAA 
TGTCCATCAT 

AAAAGGATAC 
GTACAGCCTG 
AAAATGGCGA 
GTGTTTAGAC 
GATGCAGQAT 
GCTACGAGGT 
GGAAAGGAGT 
TAAATTTGGT 
ACATCCAGGT 
AACATCCQCC 
GCGCACCATC 
TGAACCCCAT 
GACATGATCA 
TTTCTTAGAT 
AAGC7TGOGG 
AACTACAQGG 
ACOSAATTCC 
CGGTCGTGTG 
ATCACTGGGG 
GACTTTCTAT 
AACAAAGGTG 
CAGAAAAAAA 
CGTGCCGGAA 
CTATTATCTC 
AGTCAGAACT 
GCTTATGCAT 
AAAAATGTGC 



I 80 
GCTTGATATC 
ACTCAGCTGC 
GAAGAGGGAT 
AGCACATCCC 
GAAATTAAAT 
CATTTACATA 
TAAGAGTCTG 
CCAGAGCCAC 
TTGGAGAGCC 
TGAAGAACTG 
CATGGTTTTT 
CAGAGATJTC 
AGATAATCCA 
GCTTTGTCTT 
GAATTTCrcC 
AAGATACCAA 
GGGAGGTGGG 
CATTCTTTTC3 
GATTGAAAAC 
CCCTGGCTCT 
CTGGGCGCCA 
AGGATCTCTC 
AACAAAGTAG 
TTCCTCTACA 
TTITTTTACA 
TGGGACCCAT 
CAGCAGCTGG 
ATTACTTGAA 
ACCAGCTCCT 
CAGTCTGAAA 
GAGAGCGGAC 
CCTCCCAGCT 
GTCATCATAT 
CAACAACTGT 
GCACGGTGCT 
GCCAAATCCT 
TCTCTCAGAG 
AAGGATGCAC 
GTCCATTTCA 



ATATTTACAA 
TCTGAAAATG 
GACTCACGTC 



CTGACAGCCA 
CGAGGTGGCC 
AACTGCAGCT 
GATACAAACA 
TCAAGCCTTA 
AAATGGCGAA 
CTTCACATCT 
ATTACATCTC 
AGACCGAGCA 
AGGAGTTOTA 
TGGTAGACAC 



ATCTGATCGT 
ATTCAACAAC 
TTGTCATTAA 
TAAGTTTAAT 
AATATGGAGA 



I 



70 



I 



80 

160 

240 

320 

400 

460 . 

560 

640 

720 

8O0 

880 

960 

1O40 

1120 

1200 

1280 

1360 

1440 

1520 

1600 

1680 

1760 

1640. 

1920 

2000 

2080 

2160 

2240 

2320 

2400 

2480 

2560 

2640 

2720 

2800 

2880 

2960 

3040 

3120 

3200 

3280 

3360 

3440 

3520 

3600 
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3760 

3840 

3 920 
4000 

4 080 
4160 
4240 
4320 
4400 
4480 
4560 
4640 
4720 
4800 
4860 
4898 



80 



4 



1 
81 
161 
241 
321 
401 
481 
561 
641 
721 
801 
881 
961 
1041 
1121 



1 10 I 20 

MEGHVMIAFL PTII^QLFRV 
SMTTIL.KPSA PFLTSNKLU* 
SKNANHS1AV FIKRCFTFMD 
QLDYSLTDEF CRKHFLVGLL 
RINVRDVSPF FVNAGMTVKD 
TDSGNSLPER NSEKSNSIJDK 
EVCLHQFQYM GKRYIARNQE 
1ATEVCLTAL, DTLSUTLAF 
AALCYEILKC CNSKLSSIRT 
SDRLIKHTSF SSDVKDLTKR 
MCYVHVTALV AEYLTRKGVF 
KLIIPIYEKR RDFFEDEDGK 
FFDEKELQER KTEFERSHNI 
EMSKKVAEIjR QLCSSAEVDM 
NERLIKEDQL EYQEEMKAHY 
| 10 | 20 



| 30 
LTRATQEEVA 
YSWFFFDVLI 
RGFVFKQINN 
LREVGTALQE 
ESIALPAVNP 
HQQSSTU5NS 
GLGPIVHDRK 
KNQLLADHGH 
EASQIiYFLM 
IRTVLMATAQ 
RQGCTAFRVT 
EYIYKEPKLT 
RRFMFEMPFT 
IKLQLKLQGS 
REMAKELSEI 
| 30 



| 40 
VNVTRVIIHV 
KSMAQHLXEN 
YISCFAPGDP 
FREVRLIAIS 
LVTPQKGSTL 
WRCDKLDQS 
SQTLPVSRNR 
NP1/1KKVFDV 
RNNFDYTGKK 
MKEHENDPEM 
TPNIDEEASM 
PLSEISQRIi 
QTGKRQGGVE 
VSVQVNAGPL 
MHEQI CPLEE 
| 40 



| 50 
VAQCHEEGLE 
SKVKLLRNQR 
KTLFEYKFEF 
VLKNLLIKHS 
DNSLHKDLLG 
EIKSLLWCFL 
TGMMHARLQQ 
YLCFLQKHQS 
SFVRTHliQVI 
LVDLQYS1AK 
MEDVGMQDVH 
KLYSDKFGSE 
EQCKRRTILT 
AYARAFLDDT 
KTSVLPNSLH 
| 50 



| 60 
SHLRSYVKYA 
FPASYHHAAE 
LRWCNHEHY 
FDDRYASRSH 
AISGIASPYT 



YILKSMSDDA 
LGS3-J3NSLTF 
ETALKNVFTA 
ISVSQLXADV 
SYASTPEURK 
FNEDVLMELL 
NVKMIQDSGK 
AIHCFPYVKK 
NTKRYPDNKV 
IFNAISGTPT 

I 60 



| 70 
YKAEFYVASE 
TWNMLMPHI 
IPl^NLPMPFG 
QARXATLYLP 
TSTPNINSVR 
LFTYWNKAST 
NHSYGHSDAD 
LRSUCYKFPS 
VGIGETRFQQ 
TWUDSMARIH 
EQCADGLWKA 
VNPKDIjDSKY 
RIPVMYQHHT 
KLLKEVFRQF 
STMVHGMTSS 
| 70 



| 80 
YKTVHEELTK 80 
TQKFGDNPEA 160 
KGRXQRYQDIi 240 
LFGLUENVQ 320 
NADSRGSLIS 40O 
SEUtDFFTIS 4 80 
VTjHQSUJBAK 560 
TFYEGRADMC 640 
S1*SIINNCAN 720 
VKNGDLSEAA 800 
ERYELXADIY 880 
AYIQVTHVIP 960 
DLNPIEVAID 1040 
VEACGQA1AV 1120 
SSW 1194 
| 80 



I 10 

1 AATTGTAATA 
81 GAATTCGGCA 
161 ATGAAAAGCA 
241 GTCGTTGAAA 
321 GGTCTCGGCG 
401 GGGTAGATGG 
4iBl ATTTTTTCCA 
561 CATGCGATGG 
641 ACAGGAAGAA 
721 ACTTGAGGTC 
801 ACCAAATCCA 
881 CTTTGATGTA 
961 CTGCATCCTA 
1041 GAGGCATCTA 
1121 CAAGCAGATC 
1201 GTGTAGTGTG 
1281 GACCTCCAGC 
1361 GACAGCCCTC 
1441 ATGACAGATA 
1521 GTCCAGCGGA 
1601 ACCAGCTGTG 
1681 TCTCCGGCAT 
1761 ATAAGCACAG 
1841 CACATTGGGA 
1 9 2|^ TCTTAAAGAG 
2 0 O^J ATATCTGAAG 
2 0 Bg% AGTTCATGAT 
216^: GCAGCCTGGA 
2244. GCCAACATTG 
232£1 GGCCGACCAT 
24 OjQ CGGCTTTAAA 
24|jr- ATGTGTGCGG 
256§ GCTCTACTTC 
264i CTGTCAGCCA 
272^ GCCAACAGTG 
280$ AATGGCCACC 
28fl ATGCCAGCAC 
29£1 GCAGCAATGT 
3041 CGCCTTCAGG 
31|p. ACGAGGATOT 
32^01 ATCTACAAAC 
32ll GGAACCCAAA 
32|fii. TCAAAATGAT 
3*4-1 ATCCCCTTCT 
34:2:1 TGAGATCCCA 
36jgL TACACTGCTT 
3681 ATTGACGAGA 
3761 CAAACTCCAG 
3841 CAAAGCGATA 
3921 GCGGTAAACG 
4001 GGAGCTTTCT 
4 OBI TCAACGCCAT 
4161 ATGGCCCGTG 
4241 CAGGGAGGAC 
4321 AGAAGGTGCA 
4401 TCTTGAGCTG 
4481 ATCTACATGG 
4561 AATCAGGGTA 
4641 ATAGAGCAAG 
4721 TCATCGACTC 
4801 TGTGATAAAT 
4881 TGTATACAAG 
I 10 



I 20 I 30 I 40 

CGACTCACTA TAGGGCGAAT TGGGTACCGG 
CGAGTTTTAC ACCATCACCA AAACCCAGAA 
CCACCTGTTG CTCACATTCT TCCATGTCAG 
CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
AACCTTCCTT CGGGCTATCT TGGCTACCAA 
AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
AAGGCCACGT GATGATCGCC TTCTTGCCCA 
GTCGOGGTTA ACGTCACTCG GGTCATTATT 
ATATGTTAAG TACGCGTATA AGGCTGAGCC 
TGACCACGAT TCTCAAGCCT TCTGCCGATT 
CTGATCAAAT CTATGGCTCA GCATTTGATA 
TCATCATGCA GCGGAAACC6 TTGTAAATAT 
AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
AACAACTACA TTAGCTGTTT TGCTCCTGGA 
CAACCATGAA CATTATATTC CGTTGAACTT 
TTGACTACTC ATTAACAGAT GAGTTCTGCA 
CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
TCAATGTGAG G6ATGTGTCA CCCTTCCCTG 
AA TCCGCT GG TGACGCCGCA GAAGGGAAGC 
TGCTTCTCCA TATACAACCT CAACTCCAAA 
ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
CATGTCTGAT GATGCTTTGT TTACATATTG 
TCTGCCTGCA CCAGTTCCAG TACATOGGGA 
CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
TAACTCTCTC ACTTTTAACC ACAGCTATGG 
CTACTGAGGT TTGCCTGACA GCTCTGGACA 
GGACATAATC CTCTCATGAA AAAAGITTTT 
AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
CTGATGAGGA ACAACTTTGA TTACACTGGA 
GCTGATAGCA GACGTTGTTG GCATTGGGGA 
ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
GCCCAGATGA AGGAGCATGA GAACGACCCA 
GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
GTCATTACCC CAAACATCGA CGAGGAGGCC 
GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
TTATCATCCC CATTTATGAG AAGCGGAGGG 
CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
AACGTCTGAT TAAAGAAGAC CAGCPCGAGT 
GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
CAGTGGGACT CCAACAAGCA CAATGGTTCA 
TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
CATATTTTTT TAAATCTCAC TGGCAATATT 
GACTTAGATT TTATTCTTCC TTGCAGAGTA 
CAGGGAGGGC TGCACTGACA TTGATGGCTG 
CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
AATAGTGAGC TAACIX3AGCT AGACACTCAA 
CGGGACGG TC ATATATGTAT TACATTTCTA 
TTGTGCTGGT CCAGTATATG CAATACACTT 
TCTTTACT 

I 20 I 30 I 40 



I 50 
GCCCCCCCTC 
TTTTATGATG 
CTGTGACAAC 
TGAAAGACGG 
GAGCTTGGGA 
TCTGGTTTCT 
AGCCTTAGGA 
CTATCCTAAA 
CATGTGGTTG 
ATATGTTGCC 
TCCTCACCAG 
GAGAACTCCA 
GCTGATGCCA 
TCAAGAGATG 
GACCCAAAGA 
ACCAATGCCA 
GAAACCACTT 
ATCAGTGTGC 
CACCCTCTAC 
TGAACGCGGG 
ACCCTGGACA 
CATCAACAGT 
GTGAGAAGAG 
CAGTCTGAGA 
GAACAAGGCT 
AGCGATACAT 
AACAGAACAG 
CCACTCGGAC 
CGCTTTCTCT 
GATGTCTACC 
TTATAAGTTT 
ACTCCAAGCT 
^AAGAAGTCCT 
AACCAGATTC 
CTGATGTGAA 
GAGATGCTGG 
CATGGCCAGG 
AATATCTCAC 
TCCATGATGG 
TGGACTCTGG 
ATTTCTTTGA 
CTCC TTAAAC 
TCTGGATTCT 
CAGAGTTTGA 
GTGGAAGAGC 
GTACCAGCAC 
TGTGCTCCTC 
CCACTAGCAT 
AGTTTTCAGG 
ATCAGGAAGA 
GAGGAGAAGA 
CGGGATGACC 
CAGGATGCTT 
AACAACGTTA 
CAAAGTTTrC 
GTGTTAGAAT 
GGGGACCTTT 
TTGTATGACT 
TTAATCCGCT 
CATTTTTAAT 
TAATGGTTTA 



I 



50 



I 60 
GAGGTCGACG 
AGATTAAAAT 
TCAAGTAAAG 
AAGGGTGGTG 
TGGGCAGGCA 
ACAGGGATAC 
AACGAACTTG 
CCAGCTGTTC 
CCCAGTGCCA 
TCTGAATACA 
CAACAAACTA 
AAGTTAAGTT 
CACATCACTC 
TTTCACCTTC 
CCCTCTTTGA 
TTTGGAAAAG 
CTTGGTGGGA 
TCAAGAACCT 
CTGCCTCTGT 
CATCACCGTC 
ACAGCCTGCA 
GTGAGAAATG 
CAATTCCCTG 
TTAAGAGCCT 
TCAAGATCTG 
AGCCAGGAAC 
GAATGATGCA 
GCAGATGTTC 
ATTTACATTG 
. TOIUI ' ITTL ' T 
CCCTCAACAT 
GAGCTCCATC 
TTGTCCGGAC 
CAGCAGTCCC 
GGACTTAACC 
TGGACCTCCA 
ATCCATGTCA 
ACGGAAAGGC 
AAGACGTGGG 
AAAGCCGAGC 
AGATGAAGAT 
TGTACTCGGA 
AAGTATGCAT 
GAGATCCCAC 
AGTGCAAACG 
CACACTGACC 
GGCCGAGGTC 
ATGCGCGAGC 
CAATTTGTGG 
AATGAAAGCC 
CGAGOGTCTT 
AGCTCGTCTT 
TCCAAAGCCA 
TTTCTTAACA 
ATTGTGTCTT 
AGATGGCCTA 
TGCCTCGACT 
AGGATTTGTG 
ACTGGCTTCA 
ACTCACATGG 
TTCTTOTCAT 

I 60 



I 70 
GTATCGATAA 
AGAGTTGCCC 
GAAGCACGAA 
ACAAGCGAGC 
TTATGGTCCG 
TCAGGATCAG 
TAAAGTACCT 
CGAGTCCTCA 
TGAGGAAGGA 
AGACAGTGCA 
CTGAGGTACT 
GCTGCGAAAC 
AGAAGTTTGG 
ATGGACAGGG 
ATACAAGTTT 
GCAGGATTCA 
CTGTTACTGA 
GCTGATAAAG 

AAGGATGAAT 
CAAGGACCTG 
CTGATTCGAG 
GATAAGCACC 
ACTGATGTGT 
AACTTATGGA 
CAGGAGGGGT 
TGCCAGATTG 
TGCACCAGTC 
GCGTTTAAGA 
TCAAAAACAT 
TCTATGAAGG 
AGGACGGAGG 
ACATTTGCAA 
TGTCCATCAT 
AAAAGGATAC 
GTACAGCCTG 
AAAATGGCGA 
GTGTTTAGAC 
GATGCAGGAT 
GCTACGAGCT 
GGAAAGGAGT 
TAAATTTGGT 
ACATCCAGGT 
AACATCOGCC 
GCGCACCATC 
TGAACCCCAT 
GACATGATCA 
TTTCTTAGAT 
AAGCTTGCGG 
AACTACAGGG 
ACCGAATTCC 
CGGTCGTGTG 
ATCACTGGGG 
GACTTTCTAT 
AACAAAGGTG 
CAGAAAAAAA 
CGTGCCGGAA 
CTATTATCTC 
AGTCAGAACT 
GCTTATGCAT 
AAAAATGTGC 



I 



70 



I 80. 
GCTTGATATC 80 
ACTCAGCTOC 160 
GAAGAGGGAT 240 
AGCACATCCC 320 
GAAATTAAAT 400 
CATTTACATA 4 80 . 
TAAGAGTCIX3 560 
CCAGAGCCAC 640 
TTGGAGAGCC 720 
TGAAGAACTG 800 
CATGGTTTTT 880 
CAGAGA7TTC 960 
AGATAATCCA 1040 
GCTTTGTCTT 1120 
GAATTTCTCC 1200 
AAGATACCAA 1280 
GGGAGGTGGG 1360 
CATTCTTITG 1440 
GATTGAAAAC 1520 
CCCTGGCTCT 1600 
CTQGGCGCCA 1680 
AGGATCTCTC 1760 
AACAAAGTAG 1840, 
TTCCTCTACA 1920 
TTTTTTTACA 2000 
TCGGACCCAT 2080 
CAGCAGCTGG 2160 
ATTACTTGAA 2240 
ACCAGCTCCT 2320 
CAGTCTGAAA 2400 
GAGAGCGGAC 24 80 
CCTCCCAGCT 2560 
GTCATCATAT 2640 
CAACAACTC3T 2720 
GCACGGTGCT 2800 
GCCAAATCCT 2880 
TCTCTCAGAG 2960 
AAGGATGCAC 3040 
GTCCATTTCA 3120 
CATCGCOGAC 3200 
ATAT TTACAA 3280 
TCTGAAAATG 3360 
GACTCAOGTC 3440 
GCTTCATGTT 3520 
CTGACAGCCA 3600 
CGAGGTGGCC 3680 
AACTGCAGCT 3760 
GATACAAACA 384 0 
TCAAGCCTTA 3920 
AAAT GGCGAA 4000 
CTTCACATCT 4080 
ATTACATCTC 4160 
AGACCGAGCA 4240 
AGGAGTTGTA 4320 
TGGTAGACAC 4400 
AGGTTCTGGG 4480 
ATCTGATOGT 4560 
ATTCAACAAC 4 640 
TTGTCATTAA 4720 
TAAGTTTAAT 4 800 
AATATGGAGA 4 880 
4698 



I 



80 



I 1° 

1 MEGHVMI AFL 
81 SMTTIL.KPSA 
161 SKNANHSIAV 
241 QLDY SLTDEF 
321 RINVKDVSPF 
4 01 TDSGNSLPER 
4 81 EVCLMQFQYM 
561 IATEVCLTAL 
641 AALCYEILKC 
721 SDRLIKHTSF 
801 MCWHVTALV 
881 KL.I I PI YEKR 
961 FFDEKELQER 
1041 EMSKKVAEluR 
1121 NERX.IKEDQL 
I 1° 



| 20 
PTILNQ1*FRV 
DFLTSNICLLR 
FIKRCFTFMD 
CRNHFLVGLli 
PVNAGWTVTO) 
NSEKSNSLJ5K 
GKRYXARNQE 
DTLSLFTLAF 
CNSKL.SSIRT 
SSDVKDLTKR 
AEYIjTRKGVF 
RDFFEDEDGK 
KTEFERSHKI 
QLCSSAEVDM 
EYQEEMKANY 
| 20 



| 30 
LTRATQEEVA 
YSWFFFDVLI 
RGFVFKQINN 
LREVGTALiQE 
ESLALPAVNP 
HQQSSTLGNS 
GLGPIVHDRK 
KNQLLADHGH 
EASQIiYFU* 
IRTVU4ATAQ 
RQGCTAFRVX 
EYIYKEPKLT 
RRFMFEMPFT 
IKLQUOjOGS 
REMAKELSEI 



I 



| 40 
VNVTRVTIHV 
KSMAQHLIEN 
YISCFAPGDP 
FREVRLIAIS 
LVTPQKGSTL 
WRCDKLDQS 
SQTLPVSRNR 
NPIJ4KKVFDV 
RNNFDYTGKX 
MKEHENDPEM 
TPNIDEEASM 
PLSE1 SQRLL 
QTGKRQGGVE 
VSVQVNAGPL 
MHEQICPLEE 
30 | 40 



| 50 
VAQCHEEGLE 
SKVKIJjRNQR 
KTLFEYKFEF 
VLKNliLIKHS 
DNSLHKDULG 
EIKSLLWCFL. 
TGMMHARLQQ 
YLCFLQKHQS 
SFVRTHliQVT 
LVDLQYSLAK 
MEDVGMQDVH 
KLYSDKFGSE 
EQCKRRTIIjT 
AYARAFLDDT 
KTSVLPNSL« 
| 50 



| 60 
SHL*RSYVKYA 
FPASYHHAAE 
UWVCNHEHY 
FDDRYASRSH 
AISGIASPYT 
YILKSMSDDA 
LGSLDNSLTF 
ETALKNVFTA 
I SVSQIjIAPV 
SYASTPE1JIK 
FNEDVLiMELL 
NVKMIQDSGK 
AIHCFPYVKR 
OTKRYPDNKV 
IFKAISGTPT 
| 60 



| 70 
YKAEPYVASE 
TWNMLWPHI 
IPUNLPMPFG 
QARIATL.YLP 
TSTPNINSVR 
UTYWNKAST 
NHSYGHSDAD 
LRSLIYKFPS 
VGIGETRFQQ 
TVOJDSMARIH 
EQCADGLWKA 
VHPKDIjDSKY 
RIPVMYQHHT 
KLLKEVFRQF 
STMVHGMTSS 
| 70 



| 80 
YKTVHEELTK 80 
TQKFGDNPEA 160 
KGRIQRYQDL 24 0 
UGLLIENVQ 320 
KADSRGSL1 S 400 
SELMDFFTIS 4 80 
VLHQSKLEAN 560 
TFYEGRADMC 64 0 
SLSIINNCAK 720 
VKNGDLSEAA 800 
ERYELIADIY 880 
AYIQVTHVIP 960 
DLOTIEVAID 1040 
VEACGQA1AV 1120 
SSW 1194 
| 80 



1 



1 



Exon 1A (-182 to -102) 

GCAGGGGAAAAACCTGGCCCCATGATTCACTTACTTCCCACCGGATCTCTCCCATGACACGTGAGGATTA 
TTACAATTTAA -102 

Exon IB (-219 to -102) 

TTATCCCTTTACTACTTGCGAAGTGAGTTCGGTAGATGGGAGTGGAGAAGAGAACCTTAGAATCATTGTTTAGTCTTCAT 
CTTTCACAGCTCAGGCTGAAGGCCTTTCCTTGCTGAGA - 1 0 2 

Exon 1C (-143 to -102) 

GCGGCAGAGCGTGTCTGAGGTGGTGCGCGGCTCCGTGCTCCT - 1 0 2 



Exon2 and the rest of human CLASP 2 cDNA 

-101 -79 
GGCAAAGCCAAAGCTAATTGAGC 

-78 -1 
AAGCTAATTGAGCCACTCGACTATGAAAATGTCATCGTCCAGAAGAAGACTCAGATCCTGAACGACTGTTTACGGGAG 

_ 1/1 31/11 

ATG CTG CTC TTC CCT TAC GAT GAG TTT CAG ACG GCC ATC CTG AGA CGA CAG GGT CGA TAC 
Ml? Met leu leu phe pro tyr asp asp phe gin thr ala ile leu arg arg gin gly arg tyr 

S3 61/21 91/31 

SJATA TGC TCA ACA GTG CCT GCG AAG GCG GAA GAG GAA GCA CAG AGC TTG TTT GTT ACA GAG 
fn ile cys ser thr val pro ala lys ala glu glu glu ala gin ser leu phe val thr glu 

121/41 151/51 
J* TGC ATC AAA ACC TAT AAC TCT GAC TGG CAT CTT GTG AAC TAT AAA TAT GAA GAT TAC TCA 
7 s cys ile lys thr tyr asn ser asp trp his leu val asn tyr lys tyr glu asp tyr ser 

p-181/61 211/71 

H= GGA GAG TTT CGA CAG CTT CCG AAC AAA GTG GTC AAG TTG GAT AAA CTT CCA GTT CAT GTC 
W gly glu phe arg gin leu pro asn lys val val lys leu asp lys leu pro val his val 

O 241/81 271/91 

™" TAT GAA GTT GAC GAG GAG GTC GAC AAA GAT GAG GAT GCT GCC TCC CTT GGT TCC CAG AAG 
tyr glu val asp glu glu val asp lys asp glu asp ala ala ser leu gly ser gin lys 

301/101 331/111 

GGT GGG ATC ACC AAG CAT GGC TGG CTG TAC AAA GGC AAC ATG AAC AGT GCC ATC AGC GTG 
gly gly ile thr lys his gly trp leu tyr lys gly asn met asn ser ala ile ser val 

361/121 391/131 

ACC ATG AGG TCA TTT AAG AGA CGA TTT TTC CAC CTG ATT CAA CTT GGC GAT GGA TCC TAT 
thr met arg ser phe lys arg arg phe phe his leu ile gin leu gly asp gly ser tyr 

421/141 451/151 

AAT TTG AAT TTT TAT AAA GAT GAA AAG ATC TCC AAA GAA CCA AAA GGA TCA ATA TTT CTG 
asn leu asn phe tyr lys asp glu lys ile ser lys glu pro lys gly ser ile phe leu 

481/161 511/171 

GAT TCC TGT ATG GGT GTC GTT CAG AAC AAC AAA GTC AGG CGT TTT GCT TTT GAG CTC AAG 
asp ser cys met gly val val gin asn asn lys val arg arg phe ala phe glu leu lys 
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541/181 571/191 

ATG CAG GAC AAA AGT AGT TAT CTC TTG GCA GCA GAC AGT GAA GTG GAA ATG GAA GAA TGG 
met gin asp lys ser ser tyr leu leu ala ala asp ser glu val glu met glu glu trp 

601/201 631/211 

ATC ACA ATT CTA AAT AAG ATC CTC CAG CTC AAC TTT GAA GCT GCA ATG CAA GAA AAG CGA 

ile thr ile leu asn lys ile leu gin leu asn phe glu ala ala met gin glu lys arg 

661/221 691/231 

AAT GGC GAC TCT CAC GAA GAT GAT GAA CAA AGC AAA TTG GAA GGT TCT GGT TCC GGT TTA 

asn gly asp ser his glu asp asp glu gin ser lys leu glu gly ser gly ser gly leu 

721/241 751/251 

GAT AGC TAC CTG CCG GAA CTT GCC AAG AGT GCA AGA GAA GCA GAA ATC AAA CTA AAA AGT 

asp ser tyr leu pro glu leu ala lys ser ala arg glu ala glu ile lys leu lys ser 

781/261 811/271 

GAA AGC AGA GTC AAA CTT TTT TAT TTG GAC CCA GAT GCC CAG AAG CTT GAC TTC TCA TCA 

glu ser arg val lys leu phe tyr leu asp pro asp ala gin lys leu asp phe ser ser 

841/281 871/291 

: GCT GAG CCA GAA GTG AAG TCA TTT GAA GAG AAG TTT GGA AAA AGG ATC CTT GTC AAG TGC 

'ala glu pro glu val lys ser phe glu glu lys phe gly lys arg ile leu val lys cys 

j 901/301 931/311 

lAAT GAT TTA TCT TTC AAT TTG CAA TGC TGT GTT GCC GAA AAT GAA GAA GGA CCC ACT ACA 

lasn asp leu ser phe asn leu gin cys cys val ala glu asn glu glu gly pro thr thr 

961/321 991/331 

AAT GTT GAA CCT TTC TTT GTT ACT CTA TCC CTG TTT GAC ATA AAA TAC AAC CGG AAG ATT 

asn val glu pro phe phe val thr leu ser leu phe asp ile lys tyr asn arg lys ile 

1021/341 1051/351 

'TCT GCC GAT TTC CAC GTA GAC CTG AAC CAT TTC TCA GTG AGG CAA ATG CTC GCC ACC ACG 

'ser ala asp phe his val asp leu asn his phe ser val arg gin met leu ala thr thr 

11081/361 1111/371 

^TCC CCG GCG CTG ATG AAT GGC AGT GGG CAG AGC CCA TCT GTC CTC AAG GGC ATC CTT CAT 

ser pro ala leu met asn gly ser gly gin ser pro ser val leu lys gly ile leu his 

1141/381 1171/391 

GAA GCC GCC ATG CAG TAT CCG AAG CAG GGA ATA TTT TCA GTC ACT TGT CCT CAT CCA GAT 

glu ala ala met gin tyr pro lys gin gly ile phe ser val thr cys pro his pro asp 

1201/401 1231/411 

ATA TTT CTT GTG GCC AGA ATT GAA AAA GTC CTT CAG GGG AGC ATC ACA CAT TGC GCT GAG 

ile phe leu val ala arg ile glu lys val leu gin gly ser ile thr his cys ala glu 

1261/421 1291/431 

CCA TAT ATG AAA AGT TCA GAC TCT TCT AAG GTG GCC CAG AAG GTG CTG AAG AAT GCC AAG 

pro tyr met lys ser ser asp ser ser lys val ala gin lys val leu lys asn ala lys 

1321/441 1351/451 

CAG GCA TGC CAA AGA CTA GGA CAG TAT AGA ATG CCA TTT GCT TGG GCA GCA AGG ACA TTG 

gin ala cys gin arg leu gly gin tyr arg met pro phe ala trp ala ala arg thr leu 
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1381/461 

TTT AAG GAT GCA TCT GGA AAT CTT GAC AAA 
phe lys asp ala ser gly asn leu asp lys 

1441/481 

GAC AGC AAT AAG CTA TCC AAT GAT GAC ATG 
asp ser asn lys leu ser asn asp asp met 

1501/501 

GAG AAG ATG GCT AAG CTC CCA GTG ATT TTA 
glu lys met ala lys leu pro val ile leu 

1561/521 

TCC TCA GAC TTC CCT AAT TAT GTT AAT TCA 
ser ser asp phe pro asn tyr val asn ser 

1621/541 

TGC AGT AAA ACT CCC ATC ACG TTT GAA GTG 
cys ser lys thr pro ile thr phe glu val 

1681/561 

ACT CAG CCT TAC ACC ATC TAC ACC AAT CAC 
thr gin pro tyr thr ile tyr thr asn his 

1741/581 

GAC AGT CAG AAG TCT TTT GCC AAG GCT AGA 
asp ser gin lys ser phe ala lys ala arg 

1801/601 

TCA GAT GAG GAA GAC TCT CAG CCC CTT AAG 
ser asp glu glu asp ser gin pro leu lys 

1861/621 

TTC ACA AGA AGC GCC TTT GCT GCA GTT TTA 
phe thr arg ser ala phe ala ala val leu 

1921/641 

GAG ATT AAA ATA GAG TTG CCC ACT CAG CTG 
glu ile lys ile glu leu pro thr gin leu 

1981/661 

TTC CAT GTC AGC TGT GAC AAC TCA AGT AAA 
phe his val ser cys asp asn ser ser lys 

2041/681 

ACC CAA GTT GGC TAC TCC TGG CTT CCC CTC 
thr gin val gly tyr ser trp leu pro leu 

2101/701 

CAG CAC ATC CCG GTC TCG GCG TAC CTT CCT 
gin his ile pro val ser ala tyr leu pro 

2161/721 

ATG GGC AGG CAT TAT GGT CCG GAA ATT AAA 
met gly arg his tyr gly pro glu ile lys 



1411/471 

AAT GCC AGA TTT TCT GCC ATC TAC AGG CAA 
asn ala arg phe ser ala ile tyr arg gin 

1471/491 

CTC AAG TTA CTT GCA GAC TTT CGG AAA CCT 
leu lys leu leu ala asp phe arg lys pro 

1531/511 

GGC AAT CTA GAC ATT ACA ATT GAT AAT GTT 
gly asn leu asp ile thr ile asp asn val 

1591/531 

TCA TAC ATT CCC ACA AAA CAA TTT GAA ACC 
ser tyr ile pro thr lys gin phe glu thr 

1651/551 

GAG GAA TTT GTG CCC TGC ATA CCA AAA CAC 
glu glu phe val pro cys ile pro lys his 

1711/571 

CTT TAC GTT TAT CCT AAG TAC TTG AAA TAC 
leu tyr val tyr pro lys tyr leu lys tyr 

1771/591 

AAT ATT GCG ATT TGC ATT GAA TTC AAA GAT 
asn ile ala ile cys ile glu phe lys asp 

1831/611 

TGC ATT TAT GGC AGA CCT GGT GGG CCA GTT 
cys ile tyr gly arg pro gly gly pro val 

1891/631 

CAC CAT CAC CAA AAC CCA GAA TTT TAT GAT 
his his his gin asn pro glu phe tyr asp 

1951/651 

CAT GAA AAG CAC CAC CTG TTG CTC ACA TTC 
his glu lys his his leu leu leu thr phe 

2011/671 

GGA AGC ACG AAG AAG AGG GAT GTC GTT GAA 
gly ser thr lys lys arg asp val val glu 

2071/691 

CTG AAA GAC GGA AGG GTG GTG ACA AGC GAG 
leu lys asp gly arg val val thr ser glu 

2131/711 

TCG GGC CAT CTT GGC TAC CAA GAG CTT GGG 
ser gly his leu gly tyr gin glu leu gly 

2191/731 

TGG GTA GAT GGA GGC AAG CCA CTG CTG AAA 
trp val asp gly gly lys pro leu leu lys 
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2221/741 

ATT TCC ACT CAT CTG GTT TCT ACA GTG TAT 
ile ser thr his leu val ser thr val tyr 

2281/761 

CAG TAC TGT CAG AAA ACC GAA TCT GGA GCC 
gin tyr cys gin lys thr glu ser gly ala 

2341/781 

CTT AAG AGT CTG CAT GCG ATG GAA GGC CAC 
leu lys ser leu his ala met glu gly his 

2401/801 

AAC CAG CTG TTC CGA GTC CTC ACC AGA GCC 
asn gin leu phe arg val leu thr arg ala 

2461/821 

CGG GTC ATT ATT CAT GTG GTT GCC CAG TGC 
arg val ile ile his val val ala gin cys 

2521/841 

^TCA TAT GTT AAG TAC GCG TAT AAG GCT GAG 
^zser tyr val lys tyr ala tyr lys ala glu 

22581/861 

^CAT GAA GAA CTG ACC AAA TCC ATG ACC ACG 
^Jhis glu glu leu thr lys ser met thr thr 

102641/881 

\JAGC AAC AAA CTA CTG AGG TAC TCA TGG TTT 
5 ser asn lys leu leu arg tyr ser trp phe 

U2701/901 

£TCAG CAT TTG ATA GAG AAC TCC AAA GTT AAG 
f"~gln his leu ile glu asn ser lys val lys 

^2761/921 

CJTAT CAT CAT GCA GCG GAA ACC GTT GTA AAT 
tyr his his ala ala glu thr val val asn 

2821/941 

GGA GAT AAT CCA GAG GCA TCT AAG AAC GCG 
gly asp asn pro glu ala ser lys asn ala 

2881/961 

TGT TTC ACC TTC ATG GAC AGG GGC TTT GTC 
cys phe thr phe met asp arg gly phe val 

2941/981 

TTT GCT CCT GGA GAC CCA AAG ACC CTC TTT 
phe ala pro gly asp pro lys thr leu phe 

3001/1001 

TGC AAC CAT GAA CAT TAT ATT CCG TTG AAC 
cys asn his glu his tyr ile pro leu asn 



2251/751 

ACT CAG GAT CAG CAT TTA CAT AAT TTT TTC 
thr gin asp gin his leu his asn phe phe 

2311/771 

CAA GCC TTA GGA AAC GAA CTT GTA AAG TAC 
gin ala leu gly asn glu leu val lys tyr 

2371/791 

GTG ATG ATC GCC TTC TTG CCC ACT ATC CTA 
val met ile ala phe leu pro thr ile leu 

2431/811 

ACA CAG GAA GAA GTC GCG GTT AAC GTG ACT 
thr gin glu glu val ala val asn val thr 

2491/831 

CAT GAG GAA GGA TTG GAG AGC CAC TTG AGG 
his glu glu gly leu glu ser his leu arg 

2551/851 

CCA TAT GTT GCC TCT GAA TAC AAG ACA GTG 
pro tyr val ala ser glu tyr lys thr val 

2611/871 

ATT CTC AAG CCT TCT GCC GAT TTC CTC ACC 
ile leu lys pro ser ala asp phe leu thr 

2671/891 

TTC TTT GAT GTA CTG ATC AAA TCT ATG GCT 
phe phe asp val leu ile lys ser met ala 

2731/911 

TTG CTG CGA AAC CAG AGA TTT CCT GCA TCC 
leu leu arg asn gin arg phe pro ala ser 

2791/931 

ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT 
met leu met pro his ile thr gin lys phe 

2851/951 

AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA 
asn his ser leu ala val phe ile lys arg 

2911/971 

TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT 
phe lys gin ile asn asn tyr ile ser cys 

2971/991 

GAA TAC AAG .TTT GAA TTT CTC CGT GTA GTG 
glu tyr lys phe glu phe leu arg val val 

3031/1011 

TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT 
leu pro met pro phe gly lys gly arg ile 
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3061/1021 3091/1031 

CAA AGA TAC CAA GAC CTC GAG CTT GAC TAG TCA TTA ACA GAT GAG TTC TGC AGA AAC CAC* 
gin arg tyr gin asp leu gin leu asp tyr ser leu thr asp glu phe cys arg asn his 



3121/1041 

TTC TTG GTG GGA CTG TTA CTG AGG GAG GTG 
phe leu val gly leu leu leu arg glu val 

3181/1061 

CGT CTG ATC GCC ATC AGT GTG CTC AAG AAC 
arg leu ile ala ile ser val leu lys asn 

3241/1081 

TAT GCT TCA AGG AGC CAT CAG GCA AGG ATA 
tyr ala ser arg ser his gin ala arg ile 

3301/1101 

CTG ATT GAA AAC GTC CAG CGG ATC AAT GTG 
leu ile glu asn val gin arg ile asn val 

3361/1121 

. GGC ATG ACC GTG AAG GAT GAA TCC CTG GCT 
fgly met thr val lys asp glu ser leu ala 

^3421/1141 

J CAG AAG GGA AGC ACC CTG GAC AAC AGC CTG 
J gin lys gly ser thr leu asp asn ser leu 

J3481/1161 

j ATT GCT TCT CCA TAT ACA ACC TCA ACT CCA 
ile ala ser pro tyr thr thr ser thr pro 

3541/1181 

^AGA GGA TCT CTC ATA AGC ACA GAT TCG GGT 
arg gly ser leu ile ser thr asp ser gly 

P3601/1201 

5 AGC AAT TCC CTG GAT AAG CAC CAA CAA AGT 
ser asn ser leu asp lys his gin gin ser 

3661/1221 

GAT AAA CTT GAC CAG TCT GAG ATT AAG AGC 
asp lys leu asp gin ser glu ile lys ser 

3721/1241 

AGC ATG TCT GAT GAT GCT TTG TTT • ACA TAT 
ser met ser asp asp ala leu phe thr tyr 

3781/1261 

GAT TTT TTT ACA ATA TCT GAA GTC TGC CTG 
asp phe phe thr ile ser glu val cys leu 

3841/1281 

ATA GCC AGG AAC CAG GAG GGG TTG GGA CCC 
ile ala arg asn gin glu gly leu gly pro 



3151/1051 

GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC 
gly thr ala leu gin glu phe arg glu val 

3211/1071 

CTG CTG ATA AAG CAT TCT TTT GAT GAC AGA 
leu leu ile lys his ser phe asp asp arg 

3271/1091 

GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG 
ala thr leu tyr leu pro leu phe gly leu 

3331/1111 

AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG 
arg asp val ser pro phe pro val asn ala 

3391/1131 

CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG 
leu pro ala val asn pro leu val thr pro 

3451/1151 

CAC AAG GAC CTG CTG GGC GCC ATC TCC GGC 
his lys asp leu leu gly ala ile ser gly 

3511/1171 

AAC ATC AAC AGT GTG AGA AAT GCT GAT TCG 
asn ile asn ser val arg asn ala asp ser 

3571/1191 

AAC AGC CTT CCA GAA AGG AAT AGT GAG AAG 
asn ser leu pro glu arg asn ser glu lys 

3631/1211 

AGC ACA TTG GGA AAT TCC GTG GTT CGC TGT 
ser thr leu gly asn ser val val arg cys 

3691/1231 

CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG 
leu leu met cys phe leu tyr ile leu lys 

3751/1251 

TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG 
trp asn lys ala ser thr ser glu leu met 

3811/1271 

CAC CAG TTC CAG TAC ATG GGG AAG CGA TAC 
his gin phe gin tyr met gly lys arg tyr 

3871/1291 

ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG 
ile val his asp arg lys ser gin thr leu 
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3901/1301 

CCT GTT TCC CGT AAC AGA ACA GGA ATG ATG 
pro val ser arg asn arg thr gly met met 

3961/1321 

GAT AAC TCT CTC ACT TTT AAC CAC AGC TAT 
asp asn ser leu thr phe asn his ser tyr 

4021/1341 

TCA TTA CTT GAA GCC AAC ATT GCT ACT GAG 
ser leu leu glu ala asn ile ala thr glu 

4081/1361 

CTA TTT ACA TTG GCG TTT AAG AAC CAG CTC 
leu phe thr leu ala phe lys asn gin leu 

4141/1381 

AAA AAA GTT TTT GAT GTC TAC CTG TGT TTT 
lys lys val phe asp val tyr leu cys phe 

4201/1401 

AAA AAT GTC TTC ACT GCC TTA AGG TCC TTA 
lys asn val phe thr ala leu arg ser leu 

4261/1421 

GGG AGA GCG GAC ATG TGT GCG GCT CTG TGT 
gly arg ala asp met cys ala ala leu cys 

4321/1441 

CTG AGC TCC ATC AGG ACG GAG GCC TCC CAG 
leu ser ser ile arg thr glu ala ser gin 

4381/1461 

GAT TAC ACT GGA AAG AAG TCC TTT GTC CGG 
asp tyr thr gly lys lys ser phe val arg 

4441/1481 

CAG CTG ATA GCA GAC GTT GTT GGC ATT GGG 
gin leu ile ala asp val val gly ile gly 

4501/1501 

ATC AAC AAC TGT GCC AAC AGT GAC CGG CTT 
ile asn asn cys ala asn ser asp arg leu 

4561/1521 

AAG GAC TTA ACC AAA AGG ATA CGC ACG GTG 
lys asp leu thr lys arg ile arg thr val 

4621/1541 

GAG AAC GAC CCA GAG ATG CTG GTG GAC CTC 
glu asn asp pro glu met leu val asp leu 

4681/1561 

ACG CCC GAG CTC AGG AAG ACG TGG CTC GAC 
thr pro glu leu arg lys thr trp leu asp 



3931/1311 

CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG 
his ala arg leu gin gin leu gly ser leu 

3991/1331 

GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG 
gly his ser asp ala asp val leu his gin 

4051/1351 

GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT 
val cys leu thr ala leu asp thr leu ser 

4111/1371 

CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG 
leu ala asp his gly his asn pro leu met 

4171/1391 

CTT CAA AAA CAT CAG TCT GAA ACG GCT TTA 
leu gin lys his gin ser glu thr ala leu 

4231/1411 

ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA 
ile tyr lys phe pro ser thr phe tyr glu 

4291/1431 

TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG 
tyr glu ile leu lys cys cys asn ser lys 

4351/1451 

CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT 
leu leu tyr phe leu met arg asn asn phe 

4411/1471 

ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC 
thr his leu gin val ile ile ser val ser 

4471/1491 

GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC 
glu thr arg phe gin gin ser leu ser ile 

4531/1511 

ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG 
ile lys his thr ser phe ser ser asp val 

4591/1531 

CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT 
leu met ala thr ala gin met lys glu his 

4651/1551 

CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC 
gin tyr ser leu ala lys ser tyr ala ser 

4711/1571 

AGC ATG GCC AGG ATC CAT .GTC AAA AAT GGC 
ser met ala arg ile his val lys asn gly 
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4741/1581 

GAT CTC TCA GAG GCA GCA ATG TGC TAT GTC 
asp leu ser glu ala ala met cys tyr val 

4801/1601 

ACA CGG AAA GGC GTG TTT AGA CAA GGA TGC 
thr arg lys gly val phe arg gin gly cys 

4861/1621 

GAC GAG GAG GCC TCC ATG ATG GAA GAC GTG 
asp glu glu ala ser met met glu asp val 

4921/1641 

GTG CTG ATG GAG CTC CTT GAG CAG TGC GCA 
val leu met glu leu leu glu gin cys ala 

4981/1661 

CTC ATC GCC GAC ATC TAC AAA CTT ATC ATC 
leu ile ala asp ile tyr lys leu ile ile 

5041/1681 

i GAA GAT GAA GAT GGA AAG GAG TAT ATT TAC 
glu asp glu asp gly lys glu tyr ile tyr 

= 5101/1701 

[ATT TCT CAG AGA CTC CTT AAA CTG TAC TCG 
* ile ser gin arg leu leu lys leu tyr ser 

i 5161/1721 

| ATA CAG GAT TCT GGC AAG GTC AAC CCT AAG 
ile gin asp ser gly lys val asn pro lys 

i 5221/1741 

GTG ACT CAC GTC ATC CCC TTC TTT GAC GAA 
s val thr his val ile pro phe phe asp glu 

^5281/1761 

J GAG AGA TCC CAC AAC ATC CGC CGC TTC ATG 
glu arg ser his asn ile arg arg phe met 

5341/1781 

AGG CAG GGC GGG GTG GAA GAG CAG TGC AAA 
arg gin gly gly val glu glu gin cys lys 

5401/1801 

TTC CCT TAT GTG AAG AAG CGC ATC CCT GTC 
phe pro tyr val lys lys arg ile pro val 

5461/1821 

ATC GAG GTG GCC ATT GAC GAG ATG AGT AAG 
ile glu val ala ile asp glu met ser lys 

5521/1841 

TCG GCC GAG GTG GAC ATG ATC AAA CTG CAG 
ser ala glu val asp met ile lys leu gin 



4771/1591. 

CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC 
his val thr ala leu val ala glu tyr leu 

4831/1611 

ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC 
thr ala phe arg val ile thr pro asn ile 

4891/1631 

GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT 
gly met gin asp val his phe asn glu asp 

4951/1651 

GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG 
asp gly leu trp lys ala glu arg tyr glu 

5011/1671 

CCC ATT TAT GAG AAG CGG AGG GAT TTC TTT 
pro ile tyr glu lys arg arg asp phe phe 

5071/1691 

AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA 
lys glu pro lys leu thr pro leu ser glu 

5131/1711 

GAT AAA TTT GGT TCT GAA AAT GTC AAA ATG 
asp lys phe gly ser glu asn val lys met 

5191/1731 

GAT CTG GAT TCT AAG TAT GCA TAC ATC CAG 
asp leu asp ser lys tyr ala tyr ile gin 

5251/1751 

AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT 
lys glu leu gin glu arg lys thr glu phe 

5311/1771 

TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG 
phe glu met pro phe thr gin thr gly lys 

5371/1791 

CGG CGC ACC ATC CTG ACA GCC ATA CAC TGC 
arg arg thr ile leu thr ala ile his cys 

5431/1811 

ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC 
met tyr gin his his thr asp leu asn pro 

5491/1831 

AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC 
lys val ala glu leu arg gin leu cys ser 

5551/1851 

CTC AAA CTC CAG GGC AGC GTG AGT GTT CAG 
leu lys leu gin gly ser val ser val gin 
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5581/1861 

GTC AAT GCT GGC CCA CTA GCA TAT GCG CGA 
val asn ala gly pro leu ala tyr ala arg 

5641/1881 

TAT CCT GAC AAT AAA GTG AAG CTG CTT AAG 
tyr pro asp asn lys val lys leu leu lys 

5701/1901 

GGT CAA GCC TTA GCG GTA AAC GAA CGT CTG 
gly gin ala leu ala val asn glu arg leu 

5761/1921 

GAA ATG AAA GCC AAC TAC AGG GAA ATG GCG 
glu met lys ala asn tyr arg glu met ala 

5821/1941 

ATC TGC CCC CTG GAG GAG AAG ACG AGC GTC 
ile cys pro leu glu glu lys thr ser val 

5881/1961 

™ ATC AGT GGG ACT CCA ACA AGC ACA ATG GTT 
^ ile ser gly thr pro thr ser thr met val 

2 5941/1981 

PJ TGA TTA CAT CTC ATG GCC CGT GTG TGG GGA 

S| ,,_'A 

LJ 6001 

CjTTT CCA AAG CCA ATC ACT GGG GAG ACC GAG 
a " 6061 

^AAG GAA ATA AAG AAC AAC GTT ATT TCT TAA 
S 6121 

HCAC ATA TTT TTT TAA ATC TCA CTG GCA ATA 
ft 6181 

UJtGT GGT AGA CAC TCT TGA GCT GGA CTT AGA 
P6241 

OaTA GAT GGC CTA CAG AAA AAA AAG GTT CTG 
6301 

CAT TGA TGC CTG GGG GAC CTT TTG CCT CGA 
6361 

TAC AGA ACT TAC TAG TTT TGT CTA GGA GTA 
6421 

TCA TTC AAC AAC ATA GAG CAA GAA TAG TGA 
6481 

CTA CTG GCT TCA AGT CAG AAC TTT GTC ATT 
6541 

ATT ACA TTT CTA CAT TTT TAA TAC TCA CAT 
6601 

ATT TGT GCT GGT CCA GTA TAT GCA ATA CAC 
6661 

GCA ATA TGG AGA TGT ATA CAA GTC TTT ACT 



5611/1871 

GCT TTC TTA GAT GAT ACA AAC ACA AAG CGA 
ala phe leu asp asp thr asn thr lys arg 

5671/1891 

GAA GTT TTC AGG CAA TTT GTG GAA GCT TGC 
glu val phe arg gin phe val glu ala cys 

5731/1911 

ATT AAA GAA GAC CAG CTC GAG TAT CAG GAA 
ile lys glu asp gin leu glu tyr gin glu 

5791/1931 

AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG 
lys glu leu ser glu ile met his glu gin 

5851/1951 

TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC 
leu pro asn ser leu his ile phe asn ala 

5911/1971 

CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG 
his gly met thr ser ser ser ser val val 

5971 

CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC 



6031 

CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA 
6091 

CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG 
6151 

TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG 
6211 

TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA 
6271 

GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA 
6331 

CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG 
6391 

TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC 
6451 

GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG 
6511 

AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT 
6571 

GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA 
6631 

TTT AAT GGT TTA TTC TTG -TCA TAA AAA TGT 
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A. Allelic variations: single nucleotide changes (polymorphism) between CLASP-2 cDNA 
isoforms 



Isoform Difference Nucleotide(s) Consequence 

1 polymorphism 862 A to G change; mis-sense 

mutation 

2 polymorphism A to C change; mis-sense 

mutation changing codon from 
histidine to proline 

3 polymorphism 22 1 0 A to G change; mis-sense 

mutation changing codon from 
aspargine to glutamic acid 

4 polymorphism 2225 C to T change; mis-sense 

mutation changing codon from 
histidine to tyrosine 



B. Alternative splices 



Isoform Difference Nucleotide(s) Consequence 

1 exon deletion 209-291 premature, in- frame stop codon 



leading to the production of a 
truncated, most likely soluble 
protein 



These differences may be found separately or together in various combinations in the different 
human CLASP-2 isoforms 
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1st exon (nucleotides 335 to 445) 

TGTCTTGCTTATCTTTTCGCCCTCCAGGCAAAG CCAAAGCTAATTGAGCCACT 
CGACTATGAAAATGTCATCGTCCAGAAGAAGACTCAGATCCTGAACGACTGT 
TTACGGGAGATGCTGCTCTTCCCTTACGATGACTTTCAG GTAAGTAACGTTAT 
GTTTCTATCCGTAGAACCACG 

2nd exon (nucleotides 7101-7190) 

TTACCCAAGGCTTTTCCTCCTGTTTTTGTTTCCA GACGGCCATCCTGAGACGA 

CAGGGTCGATACATATGCTCAACAGTGCCTGCGAAGGCGGAAGAGGAAGCA 
C AGAGCTTGTTTGTTAC AGAG GTAAGGCTCTTTrrTGr ATT A ATTTAP A TTTT 

GAAGTCATTTTCCCCTAACTGCCTCC 
3rd exon (nucleotides 1 1439 to 1 1521) 

TTTTCTATTTTTAAAATCCCCCTTCAATAG TGCATCAAAACCTATAACTCTGAC 

TGGCATCTTGTGAACTATAAATATGAAGATTACTCAGGAGAGTTTCGACAGC 

TTCjCGAAGTGAGTAAGCTATATTATACACATAGGGAAAAGTCTTT 

4th exon (nucleotides 13987 to 14056) 

CTAAAACAAATTTTCTTTGTTGTTTTTATAG CAAAGTGGTCAAGTTGGATAAA 
CTTCCAGTTCATGTCTATGAAGTTGACGAGGAGGTCGACAAAGATGAG GTGG 
GATACCTGCTTGCTGTTGCTTCTCTTTTCACTCTAGATTTAA 

5th exon (nucleotides 15212 to 15307) 

GGAGGTTGACTGCTGGTGTTTTCCTTCTCTCCTAG GATGCTGCCTCCCTTGGTT 
CCCAGAAGGGTGGGATCACCAAGCATGGCTGGCTGTACAAAGGCAACATGA 
AC AGTGCC ATC AGCGTG ACC ATG AGGG TG A GGACCiC. AC. A TC. A CTTTGCCCTC 

CCCTCTCACAAGCCCTTTC 

6th exon (nucleotides 16269 to 16404) 

TGAAAGAATAGCTGTGTGTATATTTTTCTCTCA GTCATTTAAGAGACGATTTT 

TCCACCTGATTCAACTTGGCGATGGATCCTATAATTTGAATTTTTATAAAGAT 

GAAAAGATCTCCAAAGAACCAAAAGGATCAATATTTCTGGATTCCTGTATGG 
GTGTCGTTC AG GTAA AT ATGA A A A G A GTTTT ACC. A TT A TOTTTTPTT A 

7th exon (nucleotides 19459 to 19633) 

AAGTATGTCTGTTTATCCTTTTTTCATTTCA GAACAACAAAGTCAGGCGTTTT 

GCTTTTGAGCTCAAGATGCAGGACAAAAGTAGTTATCTCTTGGCAGCAGACA 

GTGAAGTGGAAATGGAAGAATGGATCACAATTCTAAATAAGATCCTCCAGCT 

CAACTTTGAAGCTGCAATGCAAGAAAAGCGAAATGGCGACTCTCACGAAG GT 

AGATAGGCTTGGCTTCCCCCAGGCACATACACACTCT 

8th exon (nucleotides 20567 to 20634) 
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ATTACAAGTGATTCCGATAATCTGTTTTGCCATTTTAG ATGATGAACAAAGCA 
AATTGGAAGGTTCTGGTTCCGGTTTAGATAGCTACCTGCCGGAACTTGCCAAG 
GTAACATCGTCTTATATCTTCTGCTCTTCGTTGAATGC 

9th exon (nucleotides 30257 to 30331) 

GATTGTGTTAAATGTAATTTTCATGTATCTTGTTATCAG AGTGCAAGAGAAGC 

AGAAATCAAACTAAAAAGTGAAAGCAGAGTCAAACTTTTTTATTTGGACCCA 
GATGCCCAG GTAAGAACTATCTAAATGTTTA ATATTTA A A AfPA A AT 

1 Oth exon (nucleotides 3 1 85 1 to 3 1 99 1 ) 

CATAACTTATTTATATGTTTACATTTTCTTTTAAA GAAGCTTGACTTCTCATCA 

GCTGAGCCAGAAGTGAAGTCATTTGAAGAGAAGTTTGGAAAAAGGATCCTTG 

TCAAGTGCAATGATTTATCTTTCAATTTGCAATGCTGTGTTGCCGAAAATGAA 
GAAGGACCCACTACAAAT GTA ATTTTTrATTTTA A A A ATA A APATTA A A A A A 

AAAATAGGCAG 

1 1th exon (nucleotides 32472 to 32675) 

CCATGGTGATCATTGGATTGTTTTGTTTTGTTCA GGTTGAACCTTTCTTTGTTA 

CTCTATCCCTGTTTGACATAAAATACAACCGGAAGATTTCTGCCGATTTCCAC 

GTAGACCTGAACCATTTCTCAGTGAGGCAAATGCTCGCCACCACGTCCCCGG 

CGCTGATGAATGGCAGTGGGCAGAGCCCATCTGTCCTCAAGGGCATCCTTCA 

TGAAGCCGCCATGCAGTATCCGAAGCAG GTGGGGAGTATGAGCCCAGr.ATTr 

CCACTACTCAGACTCACTTTGCATGC 

12th exon (nucleotides 33063 to 33185) 

GAATTCTGCTTACTGAAGAAAATTGTTTGCCTCCTAG GGAATATTTTCAGTCA 
CTTGTCCTCATCCAGATATATTTCTTGTGGCCAGAATTGAAAAAGTCCTTCAG 
GGGAGCATCACACATTGCGCTGAGCCATATATGAAAAGTTCAGACTCTTCTA 
AGGTATGAATGGCTTTTACGCTTTGGGGTGGTAAAAAGCAATCTGAA 

13th exon (nucleotides 36702 to 36784) 

CAGTATCTCATAGCTTTATTCTCATGTCTTCAAG GTGGCCCAGAAGGTGCTGA 

AGAATGCCAAGCAGGCATGCCAAAGACTAGGACAGTATAGAATGCCATTTGC 
TTGGGCAGCAAG GTAAGGAACACCTTTTATArrTTTTA A ATPfiATATAfiATA 

GGTGCATGG 

14th partial exon (nucleotides 37353 to 37475) 

GAAACCCAGTTTAGAAATGTTGCTTTGCCATTTCAG GACATTGTTTAAGGATG 
CATCTGGAAATCTTGACAAAAATGCCAGATTTTCTGCCATCTACAGGCAAGA 

CAGCAATAAGCTATCCAATGATGACATGCTCAAGTTACTTGCAGACTTTCGG 
AA 
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1 T AC CAAGGGCAAC T C TGGCACAC C C T AAAGT C TGGAAAGGGGACAT AGC T AGT CAGGGATGAC C C GAGAAAT GAC T GGAAGC T C CAC CAGAA 

9 3 TGCAGAGCTTCCTTTGTGCTTAAATAACTGAACAAGCATCACTCTGTGTAGCAGG^ 

185 CTTATTTCTGTTTCTTTGTGATAC CAAAAC TAGCATACTCTAATTGTAGAAAATACAAAACATAGACT 

277 AAGAAATGGCATTTGTGTATGAGAATGTCTTGCTTATCTTTTCGCCCTCCAGGC^^ 

369 TCAT CGT C CAGAAGAAGAC T CAGAT C CT GAAC GAC TGTTTAC GGGAGATGC TGC TCTTC CC TTAC GAT GACTTTCAGGTAACTAAC GTTAT G 

4 61 TTTCTATCCGTAGAACCAC GTCTTTGATC TTAACAAGCAGTATTTTTC TATC 

55 3 ATTTTTTTTG7\GACACAGTCTTGCTCTGTCACGCAGGCTTGAGTGCAGTG^ 

645 AGCATTTCTCCTGCCTCAGCCTCTCAAGTAACTGGGATTACAGGCGTGTGCCACC^ 

737 GTTTTGCGACGGTGGCCAGGGTCGTCCCAAACTCCTGG 

829 GCCACTGTGCCCGGCCTAATTATGGTTTTTAAAAGATGAAAATAAGATGTTATTTAAGAAAGA 
92 1 TTCATGAGTTGATAATTTTTAATGGTATTATTTTTGCATATT^ 

1013 GTGTATCTTGAATAAAAGTGCTATACTCTGTCTGGGCTGATTTAGTGGGCACAAGT(^ 
1105 TTTTTGGCCAAGTACCTTAGAATCTTAATGATGGGCTTTGCTGGGGAGCAGTGGGAGATI^ 
1197 AGCCTGCCTTCAGAGGCCCTGCACTCTGTGTGGGCTCCCAGCAAAGCGTTCAAGGTTA^ 
1289 AATGTGGGTGCTGTTCCTTGGAAGAAATGTGGAGGACTCAGCACAGCTCCTGCTGTCT 

1381 AC TAACAAGAAGC TGTGAGGC TGT TGAGGAAT GAGAATGACATTC TTC CT C CAGGAAAC CC GGTGGTTGTTAAATGC CTTGCAGC GAGCCAC C 
1473 TTGGTCCATTTGGAGGTTCTGGTTACTTTCCTTTC 

1 5 ^fij ACACTTCTCTACCTGATGCTGTTACCATTTAAGCCCTGTCTTTTGTGAATCGAGCTGCCT^ 

1 6 S £ TTGATTCC7\AAAGTGTTACATCCATGTACAAAAAGATAAAT 
174m ATC CCATC CCACAGAACATGACTTAACTGTATACAGCTC T^ 
lQ W ATCCTTGAAGGTTTTTTCCCACCAAATTTAAGCGACTCCAGCTC 

1 9 3^j AGAGATACCTGGGCCCCAAAATGATT7VTTCTTTAGCTCTGTCCTGCATAAAAGGAAT(^ 

20 M TGCTACCAAAGTAACAGGTAAATGGGTTGAGGTCATGCCT^ 

2 1 W. GAC TAATAAATGAGAACC TCTGAATGATGCC TTTTGCCTGTGATTTGGCAACAAATGAAAAGCAAAAT^ 

2 2 &4 TGTTGACAAGATTTTCCTGTAGTGTTGTCTGAGGAAGCTAAAGGTTATCTCAA^ 

2301 ATTAAAGTAACAACTTTTTTGAGTTTGCAACCTAGAATGAAAAATTCTATTTGTATGACT^ 

23 | 3 AC GAGATACAGTTAGTTGAGTGTrCATCTTTATCC CAGGGT^CAGGTATCTGGATGT 

2 4 &i GAAAACTTCTGTGTCCCCTTTGCTTTTAACTACTCTGGTGATAGCAGGCACAAA^ 

2 5 77 GGC TAAGGTGATGAGCAAATATTATTTTC T GTTT GGGGAGAAGTTTTC CTAGAGATT TAGGAGC TTGAATTGGAGC TTTAATC C TCATCACA 

2 6 g| GGAATTGTGATGGGCCC^GTGAAGTTTGGGTACAATTATTTGTTTTCTTATAGA^ 

2 7 M GCTTTCTTGTTAAATTGTCTCAGTGATGTTATTAACTGTCTAATTAGCTGGATGAGTGAAAGGTCT^ 

2 8 H TGTGTTTTCTTAGGCAGAATAAGAGCAGAATTATTGTATTATTAGAGGCAGA 

294 5 GAAAAGTAATAC CAAGTT GGTTAGGAAAT GGCAGCAGCAAAAC GC^TGCTGAGGGGTGAITT AC TGCAC TTAAAT7\ATTTAGCAGTATAAGT 

3037 TAACTATTAAAATAATAGAACTTGGTGTCCATTT^^ 

3129 CTTGTGTACATTTTTCAAGTAGGCACATGTTGATCTTGAGCCTITACTGGTCAC^^ 

3221 TGTCAAGAAAAGATTTGC GGGTTGGATGTAGTTGC CTGAGGATGAGAGAAGAGTAGTTACT 

3313 AAAC GTGGATTTTT AAAAATCAAAAGAATAGGCCAGGC GCAC TGGC T CAT GC C TGTAAT CC CAGCAC TTTGGGAGGCC GAGGTGGGCAGAT C 

3 4 05 ACAAGGTCAGGAGTT TGAGAC CAGC C TGGC CAATAT GGTGAAACC C CATC T C TAC CAGAAAATACAAAATTAGC CAGGCATGGTGGTGCATG 

34 97 CCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCTCTTGAAACC^ 

35 89 TTCCAGCC TGGGCGACAGAGTGAGAC TCCATC TCAAAACAAACAAAAAATCAAAATAATA^ 

3 681 C TGGGCACGGTGGCTCACAC CTGTAATCC CAGCACATTGGGAGTC C GA.GGCAGGTGGATCAC CTGAGGTCAGGTGTTTGAGACCAGC CTGGC 
3773 CAACATGGCAAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCAGG^ 

38 65 GCAAGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCCAAGATC 

3957 ATCTCAAAAAAAAAAAGGGAATATTAATGAAGTAAAGT^^ 

4 04 9 CTAGCTGATGGCCCTTCTTTTTGCAGAATCAGATCCTGTCGCTTGGGGATCTCTGCCA^ 
4141 AGCATCTTCTCTTCTAGATCTTTCCTACCCTTTAGAGACCACTTGAAATCCCATAT 

4 233 CCCACTGTCTCAATTCCTTTCCTACTGCCTGTCTGCACCGTGC^^^ 

4 325 GGCTTCTTGCACGCAGGCATCCCGCCCCGTGCTGTGGTCCTGAGAAGGGCTGGCTTTGAG^ 

4 417 CATGCACAAAATCCCTTTCTTGCTAGGTGCTAGGGTTGAATACC^ 

45 09 TTCTTTGATTC7VTCAAGTAAATATTAATAC TGTTTGGAACATGTGATAGTCCCAGC GAC TAGATTTGTAAAAATATTTGCAGGATCAATGAT 
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4 601 TTGGTTTGGCAGAAGTAGGT AATT T C TAAAATTAAAAAAT GCAGGTAAAACAGGGAC TGGAGAGGAGTATTTTTTC CTAGT GATTAATAAAC 
4 693 CTTTATTTTTCTTATTGTTTTGTTGTCTTACCCAGTTTO 

4 785 TTTAACAAGCAGTTAGACAGAGGTAATGAC CTTTAGATTAAAAGGTTTTAGGTCAAGC 

4 877 GCCTCTGTTTAACTGAAAGTCAAGCCCAGGACGCCTGCCTTTTCCATCAAAGA 

4 969 ACGATTC TT C TT CAAGTGGAGGTC TT TTTAC CAGATGGT C C TGTTGGT GGGGACAT T GTTAAC C CTGC GATTAAC C GAC GGCAT C TT CAT C T 
50 61 GGCTTTTAAGCTCCTTGTATCCTGACTTGT 

5153 TAGTAAGAATGTTGGGAGACAATTTAAGCTACCATTCATA^ 

5245 TTGTATGAAAGGATAAAAGAGCGTTAAGGGTTACACGTCCATTAGGATAAAAAAACT 
5337 GGCATGGTGGCTCACGCCTGTAATCTCAGCACTTTGGGAGGT^ 

5 4 29 ATGGTGAAAC CC CATC T C TAT TTAAAGAATAAGGC T GGGT GT GGT GGC TCACAC CTGTAATC C TAGT GC TTTGGGAGT GT GAGGCAGGTGGA 
5521 TTGCTTGAGTTCAGGAGTTTGAGACCAGCCTGGGCAACATGGTGAAACCCC^ 

5 613 C C C GCC TGTAGT C C CAGC TAC TCAGGAGGC TGAAGCATGACAATCAC TTGAAC TTGGGAGGCAGAGGT T GCAGTGAGC C GAGATGAT GC CAC 
5705 TGCACTCCAGCCTGGGTGACAGAGAGAGACTCCGTCTCAAAAAAATTAAAAAATTAGGCT 

5797 CTTTGGGAGGCCGAGGTGGGCAGATCACGAGGTCAGGAGATTGAGAC 

5889 AAAATTAGTTGGGCATGGT GGCAGGC GC C T GTAGT C C CAGC T GCT C GGGAGGC TGAGGCAGAAGAAT GGC GTGAAC CC GGGAGGC GGAGC TG 
5981 GCAGTGAGC T GAGAT TGC GC CAC T GCAC TC CAGAC T GGGC GACAGAGC GAGAC C TGT C TCAAAAAATAAAATAAAATAAAATAAATAAAAAA 

6073 ATTAAAAAGAAAAAGAAAAGGAAACCTTAAGCCTAGTTATTGA 

6 1 6^. TTAAGC CTAATGAACAC GAGCAGT TC TAAT GT CC GT T GGAGGGGAGGTAGCATTCACAGTT CATAGAT T CAT TT AGCAATTAC T GATTGAGC 

62 5 jfc ATCTTCTGTGTGTCTAGTTATCTATGCTCTTAGGCGCTGGG 

634^ GATGGTTGCACAATTGTGTGAATGTACTTAATGCCACTGAACTGTATACTT^ 

64 4^ AATC CCAGCAC T TT GGGAGGC CGAGGCGGGT GGAT C ACC T GAGATTAGGAGT TC GAGAC CAGC C T GAT CAACAT GGAGAAACC C C GT CT C T A 

6 5 3=3y CTAAAAATACAAAATTAACC GAGCGTGGTGGCGCATGCCTGTAATCC CAGC TAC TC GGGAGGC T GAGACAGGAGAATC GC TTGAAC C CGGGT 
6 6 2grj GGC GGAGGTTGCAGTGAGCCGAGATCCCGCCATTGCACTCCAGCCTGGGCAACA 

6 7 10j TAAATT TAT GCAT ATTT TAC CAGAATAAAAAAAGGCAGTTAAGACAAGTAAGAT GC TGT GT CAT GGGGC TAGATCAAGCAC TTAGGGGT GGG 

6 8 Cm GTGTTAGGGACTTT GAAC GGGGC C TC TAC C C T GC GGGAAGGGC TGAGC TGGAGGGATC TGTGGGC C C C T GAT CAAGAAAGAAGCAGGAGC T G 

6 9 0J. TAAC CCAGC C TGGC T TT GGAACT T GAGGC T GC CCAGTGGAAT C TGTT GTGT GT GAC GGAAGGAGGCAGC TGCAC T GGATGGGAGAAACT GGA 

6 9 9f3i GGGACT C T GT GGC TGCCAGGGC CAGC TGCAGGGCACACAGC TGCAC TC TGAGGC TGGCACC TGCCTCCTT CAC TTACC CAAGGC TTTTC C TC 

7 0 £gj CTGTTTTTGTTTCCAGAC GGCCATCC TGAGAC GACAGGGTCGATACATATGC TCAACAGTGC CTGCGAAGGC GGAAGAGGAAGCACAGAGCT 
7 1 7fl= TGTTTGTTACAGAGGTAAGGCTCTTTCCTGCATTAATTTACATTTTGAAGTCAT 

7 2 &M TGTCAAGGAAGTGTGAAAAGGGTAATTGTATTTCTATGATGGAAGTTCAA^ 

7 3 CD AATGTCCACAGGGGCTATTTCTTTTTTACATTTTTATTATTTTTAAAACTT 

7 4 TTCCACAGAATTTATCTCATGGACTTAAAATAAGCAGTAACTTGTAAATGAATTC^ 
754 5 TTTTAAGGGTGACACACACATTTATGTATCATTTAT^ 

7637 GCTGATAATAAGCAGGGTCTATCGCTAGTCAATATATATTATTATATATATTGATTACTATATATATTCC 

7729 ATTATTTTTGTTTGAAAATGCAAATAAAATTATCTTATGGAAGAAAGATAAATTATTTAC 

7821 AGTCTTGCTCTGTTGTCTAAGCTAGAGTGCTGTGGAGCAATCTT^ 

7913 T CAGCC T C C CAAGTAGC T GGGAT TACAGGC GT GCAC CAC CAC GC C T GGCTAAC C TT T GT AT T T TT AGTAGGGACAGGGTTT CAGC C TGT TAG 

8005 TCAGGCTGGTCTCAGACTCCTGACCTCAAGTGATCTGCCCGCCTTGGCCT^ 

8097 CAGAAAGAGAAATTATTACAATTTAGGTTGTTTGCTTTAG 

8189 GAATAATTAGAT GTTTT AATT TT GTTTC TTT AAGT GCAC C T TCATT TAAAAATAAT GATTTTTTTTTAATC C TGGTTTTT C TAGTT GAT ATT 

8281 TAGATCATAAATATGCTCATCAATAAAATTGC TTAC TATAAGGAAGCTATAAATACTCTTATAAAGAC CAATTAAATAACAATATTTAATTT 

8373 CCATTGAGATTTTTGAAAAATTAAATATAAAATTAAAAA^ 

8 4 65 CCTTTGCC^CTATATTAGTAAACTTAATTCCAGACAAATACAGCCAGA 

8557 TGGCAAAC TATGAC C TGCAAGCAAAATC CAGC C TGT AGC CAATTT TT GTAAAT AAAGTTTT AT TGTAAAGCAGC CAAGCACATTT GTTTACA 

8 64 9 TATTGCCTATGGCTACTGTCACCATGCAACTCAAAGTTAAGTAGAGAT^ 

8741 CACAAAAAGT TTGT T GAC C T ATGTT T TAAAGCATGT GGCAAAATT AT TAAT T GC TAAC T CAGT TC TC C CAGT TGATTAAAAAAATAT GGTT T 

88 33 TTTGAGGGAGAACTCTCGATTAAGTTATTTAATCACTGCA 

8925 ACTTTTGGGACCCTTCCACTTGGGTGGAGTGAAC^TCACTTCCTCTTCA^ 

9017 AACAAAGCAGAAAAAAAC TTCAAAAACC TT TCAGT GTAAC TT CAAAATATTATT GAATT TCAC CAGTTTGAAAT GTTAATC GTATAATCAGT 

9109 CAGTCACATTCCTGTCTTTTTGAAGGTACAGTTCTCAGGATCTGGCTTTCTGATGGAAC 
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9201 GGGTGGGGGTTGCTTGCCTAATTCTGTGCTTCTGCTTT^^ 

9293 TGAATCTCTAAATACTTAGCCA^ 

9385 TGAAGATAATAAACTGTAATTATTTTCCCC^TTTTGCCT^ 

9477 TTTGGGAGGCCGAGGCGGGTGGATCACGAGGTCAGGAGTTC^ 

95 69 AAGTAGCCAGGCGTCCTGGTGTTCGCCTGTAATCCCAGCTM 

9661 AGTGAGCCAAGATTGCACCACTGCACTCCAGCCTGGGTAACA^ 

9753 CTCCATTCATAAATGCATTTATAATATACAA 

98 4 5 TAATGTCCTGCAGGAGACTTGGTGGTACAATTTCAGTTCTAGAGCTTGTTGAAAAACCG^ 

9937 ACAAGACCACGTCGTACATAGTCAGGCCTGACTACTTAGCTGTGCC7\.GTG 

10029 T CT GAAAT C T GAC TGGTT TT GGAATGAAAT TTTGAGT GT GAATGTAT GAT AC T CAAAAT GTGAAAATGCTTTGAGAAATC T AAAAAAAT CAT 

10121 TTTCAAGGATAAAATAGCATTTTAAAATATTCTT^ 

10213 GCCCTTCTCTGCGATTTATTCCACCTCTCAG^ 

10305 TTATTGTAGAGCCTGGCACAAAATAAGGACTTGCTGCTC 

10397 ACCTTCAGGCATGGCCTGTGACAC^GATAGAAAGGTTGACAGTAGAAACGA 

104 8 9 TACAGCCAGAGCCAAATAAACTAGATAAGGCATAAATACATTTATCGAC^ 

10581 TGAACTCTGCACATAAATAGTCAAGTGCTGACTGGACT^ 

10673 GACTCATTCATATTTGAAGATCTGGTGGATGTGGCATTATAATCTTAGGCTTTATGCACTATTGG 

1 0 7^5 GACACCAGGCTGGCCCATTGAATCTGTCATGTGGCGAGGTAGTTTGAA 

1 0 Bfp GAAGACTACATTAAACATTCTGTACACACACACAGA^ 

1 0 ^9 TCACTCTTTTCCTTATGAATGGAAAGAAAAACTGAGGGCAGATTTATATGGTAATATTA^ 

1 1 |gl ATACCATGAAGGCGTCTTTAAGTTCACGCATCCTATTCACTTTTATGTCATAGTAGAGGCCAC^ 
1 1 K|3 TGAAATAGTCCATTGTTAAGTAAATGTCACAAGG 

1 1 |g5 AAGTGATACTACCCCCAAGGGTAATGTTTCAGTGGTTCAAAGTCTCAAGCTTCAC^ 

1 1 |j 7 TTGATGGGAAAAAATTTTTTTGATGTTTCTATGGGTATGCCTTC^^ 

1 1 44 9 TTTTCTATTTTT AAAATC C C C CTTCAATAGT GCAT GAAAAC C TATAAC TC TGACTGGCATC TT GTGAAC TATAAATAT GAAGATTAC TCAGG 

11501 AGAGTTTCGACAGCTTCCGAAGTGAGTAAGCTATATTATACACATAGGGAA 

1 1 ?? 3 TCAT7VTGCAATCAGAGT AAT TGAGGAAAAT ATTTT TAGAT GGTTTAT GTGTATGTGGTGTAACTATTGTTTACAGGGCC TTTGATTGTAAGA 
1 lS 5 CTCAAACATGCTACTTTGGTATTGATAGGTAGTAATAAACATGTGGATGGTTTAATTATGTCC^ 

1 IrH? 7 ATT^AATAGGTGAAATTTTTTCACTACAAAATATAAAACAAACAAAAC 

1 1^-6 9 AATATTTTGAAGAAGATAATTATAAAGAGCATTTTACATTGAATAAATTTTATGTTTTT 

1 1 S? 1 AGTTAAATTTTTTTTTTATCTCCTTTAATGTATTAACra 

1 2%4 3 GTGAATCGCGGTAGCTAGGTATTGCCTTGACAGACTATCTTTTATAAAAGGTTCCATTGTGT 

12145 AATTTCTAAAGAAAAGAGATTTATTTATTTATTTTTATTTATTTAGTTTTGAGACAGAGTC TTGCTCTGTCACC CAGGC TGGAATACAATGA 

12237 (^CGATCTCTGCTCACTGCAACCTCCACCTCCCT^GTTCAAGCAACT 

12329 ACCACGCCCGGCTAATTTTGTATTTTTAGTAGAGACAGGGTTTCA 

124 21 CTGCCTCTGCCTCCCA^AGTGCTGGGATTACAAGCATTAG^ 

12513 TGTACAAGAAGCATAGCACTGGCTTCTGCTCAGCTTCTGTTGAAGACTTTTGTGCTGCTIX^ 

12 605 TGGCAC T TGT GAAAAGAGAC CAAAGAGGAGGAGGAAACTCAC TTTATAACAAC C CATTC TC TTGGGTAC TAAT C CATTT CAGCAACAAGTAA 

12 697 TCCCATCTTGCCAGGAACT^GAATTGACTCATTACTGTGAGAACAGC^ 

1278 9 ATTATGCAACAAC GGGGAC CAAATTTGAATC TGAGTTTTGGTGGGAATAAAAAC CATATCCAAACCATAGCAGTTGGCAAATTGAATTATAC 

12881 TTGATTTTGGGAAAATTGAAAGGAAATAGTGATGGATT^ 

12973 TATTTTTCTTTGCGTTTGTTACATCTTTGA^ 

130 65 TTGCCACTCCTTAAGAATGTTCGGCCAATTCCCCGATTGCCTCTTTTTAAACCTCAGCCAGGA^ 

13157 ATGGCTAGCCCTTTACTTCTATATTTACCCA^ 

132 49 CCTCTTGTTCATAGTCTCAGGGTTGAAAGATATCAGACTATGTCATGTCGTATACTTACTATC 

13341 GGC7VTTAATGAGAATTTCCAAATGTGTGATGAG 

134 33 GACTCTGCCGTCTGT(^GCCTGATTTTCCTCATCTGTTA^ 

13525 AGAAGTGCGCAGGGCCTAX5GACGTCATACCGATTGAGTAAATGTAAGCTGCTT 

13 617 TGATTGTTAATCGGTTTCPACGGAACAACTCAATCTTCCTTTTT^ 

13709 TTAAAAACTAATCCACTTTCTTATTTTTAGTATGCCTGCTAACTCCCCAGAAGCTATGCTC 
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AC T GAAGT C T CTT GGCTTAC C CAC C T TGAAAAGCAAGGGCATAGAT GGTTTTATTC T TT CTC T GAATAAAGAAGC T GGGC CAT C TTT GGATT 
TAGTAAAGGCCGGGCCCTATGATGGAGGAAGAAATGCAAAGCCT^ 

AGCAAAGTGGTCAAGTT GGATAAAC TTC CAGTTCAT GTC TAT GAAGTT GAC GAGGAGGT C GACAAAGATGAGGT GGGATAC CT GC TT GC TGT 

TGCTTCTCTTTTCACTCTAGATTTAAACATCAACT 

GAGTCTTACATGAAACGGGAA.TAATTTTTTAAAAATGTTATTGATTGA 

GTTCTCTTGGTTGTGTTTATTTGTTCTGAGATCATTTTC^^ 

ACCTTGAATTTAGGTTGTTTTTCAACAGATC^^ 

AGTCTGATTTGATCAAGCCC^CTCCCCCTATTCCTAGAGGAAAGCTCATGGCTA^ 
TGCCCACTGACTGAGGGAAACCTGACAAGAAG2\CAAT 

TTTAGGTTTTTC CTTCTTTAAAAAAATTGTCAGCTGAGC TATAACATTAGC CACTCATTAAGCAATGTGCATGTAGCAAATTATTTTTATTC 
CCCCCATGACTTTATCTCTCCTTTCTGTATTGCCTCAATTTCCT 

CAGGAATGTGCAT GTGC TTTTGTCCTCT GAC TATAGGGGAGTGTCATTTGAAAACAT TT TTT C GTGAAAC CAGGCAAGAC C TTC CAACGTGA 
GTGGTCAGTTGAGGTATGTCCTTTTTGGTCCTTTTGTGGCTCACT 

T GGAAT AAAGTT T T CAGAAAT GTAGGC GGGT C TC T C T CTTT CAC C GC TAAGT GGAC TTTTAT GTGAC T TGTAGGCATT GGT GT CAATGGGT G 

CTTCAGTAAAGGGGCAATGGACAAC TTGGCACAAAGGGAATGACCTTC CCATTGAC CAAAC TCACAGCAAGCAAC CCAGGTAATAAC GGGAG 

GTTGACTGCTGGTGTTTTCCTTCTCTCCTAGGATGCTGCCTCCCTTGGTTCCCAGAAGGGTGGG^ 

GCAACATGAACAGTGCCATCAGC GTGAC CATGAGOT 

CGAGAACAATGCTCAAGATGAATG£!GCATG^TCTTC 

CCCATTGGCAmGCCTCATCCACCCACTT^^ 

ACAC TGACATAGGC T TT C TTC TC TGTAAT AAC TGAAAAGTGC TCT TT GGTAC C T CACAGAAT GT CAC CAAGGGGC TAT C T GTCAT GC CAATC 

C T GAGCAC TTC TGT GGAGGT GTAC TGCAGCAAAGTGAAGT AAAGCAAAAATT GAGGAC GAGAAAAGAAAATAGT T GCAT AGAAGAGAAGGTT 

GCAGACAGAGAAGT CAAAC CAATAGAAGAAGC TAT T CAGGAGAAAAGT GGGAC CAGAGGAACAT CAGGATTAATAACAAAGGGAAGAGAAAC 

AAGGGAGTCAGGGAGATAAAAATTAAGGAGGIAAATGTGACTGTCATTACCCTAAGGCTG 

TTCCC^TTCTGTGAGCAAGAAACCCTGGGGATTTTAGAGAAAGTTTCTGTC 

TTGGGTAGTTATGTAAAATCTCTGATTCCGTO^^ 

AGCTTCTTAGATGGGATGGGGAGTCTTGACCCC^^ 

GTCATAAGT GGGTTATTGATAGAGAT TGT GAC C C TC TTCAT TTTGAAAGAAT AGCT GTGTGT ATATTT TT CT C TCAGTCAT TTAAGAGAC GA 
TTTTTC CAC C TGATT CAAC T TGGC GATGGAT C C TATAATTT GAATT TTTATAAAGAT GAAAAGAT CTC CAAAGAAC CAAAAGGATCAATATT 
TCTGGATTCCTGTATGGGTGTCGTTCAGGTAAATATGAAAAGAGTTTTAC CATTATGTTTTC TTATC TGCAGTAGTGCTTATGTGTAAATTA 
GCAGATTTAAGGAAACACTTCCAAAAATGGCAATATGCATGGTAGA^ 

TGT AGAAGAT GGAAGGGAT AATGT AGAGGCAGAATTATGC T GTGGCAGGCAGGAGCAC TCT GGC T C GGC CAC T TTATAGC T GC GT GAC CTTT 
AACAGGCTACTTAATTCAGATAATGAGAATGTTTCTTTAATACGG 

TTCATAGTATCT CAGTGGT GATTT T TATC GC TAGCAT TGTAGTAC CAGTGGC GGTGT AGAT CAGTAAAGAGATTAGGTTT CAGC GCAGATTG 

AGTTCAAATCCCTGCTCCTCC^CTTACCAACTGTGTAACCTTGGAGATGTTATTTAACCTCTC 

AGGATAATGGCAGTACCAAATATGGTTACTGAGAGGGTTCATT 

TGGTCAGTATTAGATAGTTTTGTTATCAT^^ 

AAGATAAC^CATGAATGTATTTTTCTTGTAAAAAA^ 

CTTTTCTGAGGGGAGCTACTACCAGTTTTGCATGCATCCTTCTGTAGCTTTTTAGCATTCT 
ATCATCTGTCCATCCATCCATCC^TCCATCCATCTGTC^ 

TTATAAGCATACACAGGACATGGGATAGCATAAATTTAAAGTGGGGGTGAAAGC^^ 

AATAGAGTTAAGGAGAAAGCGTATGTTTTGAAGATC TAACACCTGCTGTGGGTGGGCCACCAC C TGGGCTCTATGCTTTCTCACTTGGAGAC 
C TGTTTAGTCAC GCAATT CACAGT GCACAT GAGATAAAGGCAT GATGC CT GT T TAGT C GAC TC T AGAAGCAC C C C T GAC TTTAAAAAGAAGT 
TAAAGCAAAACTAAATGTATTTGGCAACCTCATTTTTT7U\AGTAGGAAGTAATTATTTTTGTTTTOT 

GC C CAGGGACAGATGTTT ATAAGTACAAC T GC C C TGAGC TAT CAATT AGTC T C C GGGGTGCAT T T CAAAATC TAAGGT TCT GAC T TCAAT GG 

AAGTCTCTTCCTTGAAATTGTCTTTGCAGATGCAGCTGATGGTGTTTTCATTTAATAAAGT 

TGTTTTTATCTGTGTCTGTTTGTAAACTAAGC7VTCAAAAGTTGTGATTTAAGTGTTTTTAAAAATTA 

TAGTTGACTGGTGCTGTGAATTAAAAAAAGTGCCTAAACTAAAAAATTTTGAAGCATTTTA 

AGATGAGAAAACTGAGGCTCAGAAACAGAAATTTAGAATTGAGGCCT 

GTAT GCAGGGAAAGGAGGAGAGAAAATT CAT TTT GTT TCAAGC CTT T GAC T T C T TC C C TGGTT C TTGC C TTGAAGTTTAAGTGGAAT CCAAA 
GTGGCAATTACTGAGCCC7\.CAGCAGACAGTCTGTGCACAAGAGTGT 
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18 4 01 TGCTGCAGAATCACAGAGAC TTTT GGGAGGGTTGC C C TGTC C C TGAGACC TC CACCAAGGAAC T CTTAGAGAGAGTGTGGATAAC C CAGT AG 

184 93 GATTTTAGTGGC TAT GC GGGGGGC TGTC C T GC TGGC T GAGGT TAGTGGGAGTGTTTGATTTCATATC GC TCAGCC TGTC CTTAGAGGGGATC 

185 85 TTGTGCCATGATCCTCAGAGCTGAACCTCTGTCTACTGCGGCGAACCTGGGGAGA^ 

18 677 CTGGCATCTATTGGCTTAAGGCCATAAATTTCGCTAAAC7VTTGTACAATGCA 
18769 TCAGTAGTACCGAGATTGAGAAATCCTGGCCTAGTGCATGTTCATCTTCCGTCTGT 
188 61 AGTCACCCTCTTGCAGGCATGAAAACTGGAGGCATGAGGCAAGGCCACGGA^ 
18953 AAGTTCCAGATAAGTAAGCCAAACATAGTAATA^ 

19045 CTGAAGAAATATGAAAATTTTA^TGAAATTTCATTTAT^ 

19137 GAGACTAAGGATGTGTAGCACTGACAAAAATGGGAGCACTGAGGAGGTOT 

19229 AAGCAGGTAATGAGTCAGAAGGAAAAATAAAGCTTGCAGAAAGTGAAGCAGGGA 

19321 ATCAGGAGGGGGCAAGCTATTTAAAATAGCTTGGTTCTAAAT^ 

19413 TTCTAATTCTTTTAAAGTATGTCTGTTTATCCTTTTTTCATTTCAGAACAACAAT^ 

195 05 AAAGTAGTTATC TC TTGGCAGCAGACAGTGAAGTGGAAATGGAAGAATG 

19597 GCAATGCAAGAAAAGCGAAATGGCGACTCTCACGAAGCTAGATAGGCTTGGC^ 

1968 9 TTGC CAGGTGGGTATAAGAAGGAGAC CTGTGTTACACAAGTACATGAGA^ TCTTTACAAATATCCTGTCAGGAAA 
19781 GGT TTT GTCACATT ATAAC T TAG TT C CC T GACAT TT C GTATAT GGAAATCATGTAAT GGGAAGAACCAAAGC TTTGGAGGCAGAAAGGGAGA 

19873 C C TGGGTT T GAGT GC CATAAATAC TGTAT TT CAGC TGTGTAGC CC TGGCTAAACAAC TTAT GTTTTCTGAGC CTCAGTTGACTCAC C TATAA 

1 9 %4p AATGGGAATAAACATGAAAATTGCTGGGAAGATGGGAAGTGTAAATAAGAAAATG^ 
2 0 0^7 AAAATATTAGTAATAATTAGAATGCATGGGAGCCTCAGATTA 

2 0 lffp AACCCTTAGGTGATTCCCAACCCTGGCTTCCCCTTAGAAGTACCTGGGAGCTTTTTA7W\ATACC^^ 

2 0 2jj. ATTTAGTT GGTC T GGGGT GGAGC C TGGGCAGGTC T GACTTTTAGGGGGTC T CAT GGAC GTGTC CATGT GGGCTGTTGTTCATAGCTACTGTC 

2 0 33p AGTTCTAATTGGACGGTGTCCATGCTATACCAGCTGCTCAGTGTTTTGACTTTCOT 

2 0 4g5 CCAAGTGTTTCCCAGGAGCATCCAGAGTGGGGAACCACTGTGTTCATT^ 

2 0 5J7 GAAATGCTATGAATTACAAGTGATTCCGATAATCTGTTTTGCCATTTTAGATGATG^ 

2 0 6Q9 ATAGCTACCTGCCGGAACTTGCCAAGGTAACATCGTCTTATATCT 

20701 TGTCCAGAATGGAATCTGTTGAAATCATAAAAATTACACT 

20%93 CAGTATGATTTCAGGTCTGTCTTCTAAAAAGTACCCTACAAAGCATTCTCCTTTTATTATTAT^ 

2 0 £g 5 AACACTGGATCACAGGTACTGAAGAAATAACTGAAATATGCAG 

2 0 M? 7 C TC C CTTAC TCAGGGTAAT AGACAC GGTTC CAAAGAGGAAGGACC TGGTAAT C TTGC CACGAAAC CC GGGGGTTGC CTGAGTTACAGAAATT 

2 1 W 9 GTTTCGGGTCACTCTTACTGGAAAAAAAATAAGCTATTCCTGTGTCTTACA^ 

2 1 GCAAATCAACCATACTGCTACTTCCCAGATTAAATATCTATTATGATGTTGCCTTTT^ 
2 1 24 3 TTATAGTTGAAGTTCTTTTTAAACACTGT^^ 

2134 5 ACTTTC^TTTTTATATCTTTACAAATAAATCTATCATAATGATATATACAGTATGATTTA^ 

214 37 TACCATATGTATTCTGCACTTTAAAAAATTTTTAAAT^ 

21529 CATTTTTTTCTATTATAAACAAAGCTACAATGACTGT^^ 

21621 TATCTTTGTCTTCTCTAGCCCTTTGCACTCTTACTCTGTTACTGCCCTTCTATTCTTTTTTG^ 

21713 CCACATCTTAAACACTATAATAGAAACATGGTTTATCTATATAGGAT 

218 97 TAGCACCTGCCTATTAAAGCTTATTTTAGTGCCACCATTGTAAACCACCTCCTAGTCTGGG^ 

2198 9 ATGTCTGTGTATCATATTTTGCATTGAGATTTGCTTTTTTGTTTCTGGATGTTTGGGG^ 

22081 ATTTGGGTTTTAGTTTGTTGCAGCAGGTAATAT^^ 

22173 GATACCTAGGAGAAGATTCATGTTATTTGAGCCTAATGTTGATTAATTAAAAATCTATGCTTTTTCCTATGAG^ 

222 65 CTCCTTCTCTTACTACCCAGCCAATATAATTCAGTATTGTTTGATCCCAAGACCTAGGGAGAT^ 

22357 ATGTATACATTTATGTATATATACTTTTT^^^ 

2244 9 CAAGTTAGGTTAAGC CATTT TAGTT GGT GAAATCAGTTT GATTTCAAC CC C T GC TT C TTTTT GTTTT GTT TC TATT TAGTTTTTTATTC TTT 

225 41 TTATTGAGGTATAATTTGCAATAGCAGAATGCTCAAACATGAATTGTAGAGC 

22 633 CAGGCTAGCTTGTACCTGAGACCCTCTTTATTTTGACCTCCTVTCACCGTAGATTAGTTTTG^ 
22725 TAGATGTACTCTTTGTGTCAGGCTTATTTAGCTAAACATGTGATTCACTTTA 

22 817 AATCCCAGCACTTTGGGAGGCTGAGGTGAGCGGATCTTTTGAGGTTAGGAGT^ 

22909 TACTAGAAATACAAAAATTAGCTAGGCGTGGTGGCAGGTGCCTCTAATCCCAGCTACTTG 
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23001 GAAGGCAGAGATT GCAGT GAGCTGAGAT CAT GCCAC T GCAC T C TAGC C TGGGTGACAGAGCAAGACT GT GTC T GAAAAAAAAAAAAAGGGTC 

23093 cgttttaatgaaataaaatggaatggagaatatgaaagtacactgcc c ttaataatgacattattttttatataaaatactgtcattattat 
23185 tttgctggcacctgccticcatgcctagctaatttt^ 

23277 CTAACCTCAAAAGATCCACC(^CTCAGCCTCCCAAACTGCTG?^ 
23369 AGTGAATTACATAATTTTTTACATAAAATAATCT 
234 61 GATATAAAAGATATTTGTTTTTCTTACTGTGCm 

23553 GATACTTTGATATCTCTATACAATGTGTGATGATCAAATCAGGATAATTGGGATAf^ 
23645 TCW\ACATTTCTTACCAGGAGT<^T^ 
23737 TACAAATACTTTTATTTAGGAAGGTAGAAAGGTGG^ 
23829 GCACATGGGTATCTGTGGGCTTTGCCTTTTAAAGCCCAGCTTGGTGT 

23921 C C C CAGC C TT GAGCAAAAAT GCAGTT TT GGC T GT T TATCAT C C CC C TTCAGGGTGTC T GAAC TAT TT GCACC GGTT GAGAAGGCAAAGAAGT 
TGACCTGATAACTGTTGGT(^TCCCATTAGGAAGGATGGATTCCATGGTTA(^ 
AGAGAACTGTGCAAGGAGTTTACCCAGGGTATGAAGAGGTAAAGAGGTCT^ 
CTGGGCCACAGTTCAGTAAGATTACAAAGAAGGGCCTAGAACA 
24 289 TTTTCTAATCAGCTCCTC^ 

24381 AC^GGAAGTTCTTTCAGTGCAAAAATAACTGATGTCTCATTCTGGAATATTATGG^ 
24473 AGGTTTTCAGAATATTTTGTATAATCTTGGAGCTTAT^ 
2 4 L J 5 ^^TTACCCTAAAACTGGTTCTTTTCCTCCTAra 

2 4 5f 7 TATAAATATTTTAGTTTGATACACAAACCGTACTTCAAGTTAGTTAATTGA 

2 4gfi 9 AAAC TT CATTAC T AGTTATT TAATAATTT ATT TC T C TGGT TACATTTATAT C T TAGGT GACATCAGAACATATATGTGAC C TC C T TAAAGAT 

2 ^Gif 1 AGTGTGAAGAAAACCCACCTTATGTTTTCTTCCACAGCTTTTCTGTTTGrrGAGCTTTTATTTTTGT^ 

24^33 CTTTGGTTTCCCCATCTGGTTCTGAAAGAGAAGTAGAATTTCTTC 

2 5gg 5 GCAAATGAACTTATTGTTCCAGGTAAATCTTCCACAGTTGCATGCAGGGGAT^ 

2 7 TGAGGATCAAGGGC CATGCTCAGCAGAACTTGTCAGGAC CCAGCAGTTTCAC GGACAC CTTTTCTTAATTTTTAAACCAAGTCTATAATAAG 

2 S2p 9 TGCTTTCTTCCCTAGATTCCAATCCAGAAAAC7\ATATC^ 

2 $3 0 1 TGTTGCATTTCTGGGCCTCCTTTGTGAAGAGGATGAACTGATGGTC 

2 5^9 3 CTCTTTAAATTAAAGATTATATATTTTGGCCTCAAAACATTTTGCAAACTCCTCCTATTCC^ 
2 £|8 5 ACACATTGTTTCTGATAATTCATCCTCAGAATAAGATGCTGTTGGCCATAATCTTTGTCTC 
2 S#7 7 AAGACACAGAGTATGCCTAAAGCCTACAGCAGACTO^ 

2 y ^ 9 CTTGTCAGGGTGTGTCATTAGTGCTTGAATGTAGGGTTTACAGGATGGGGTGGGGTGGGGGAATC 
2^61 TCCTTGTCTTCTCAGCAAACACCAGTTTCTACAGAGAACAGCTCTGCCATTGTG^ 
2 &S : 5 3 CGAAACCCAGAGTAGTCAGTGGGCTTTGGGCAGGAAAGTGGCAAC^ 

AGACTGAC CACAGGTTATTTTAAGAGCAGAGCTGGTTTC CATCACTC TGAGAAGTGC TCAACTACAGAC TTTGGGATGATATTTGTTATAGC 
TGTATTTTCTCCACTCTTAGATTGTGAAAGTACATATTACAAGTATTTATTTTATTATCTTTACTA 
TGCCGCAATAAGTAAAAATACCC7W^GTTGTTTAAAGAA7\AGTTCACCTTTTCCCTTCA 
TTAATAACCGGTTGCAATTCCCCTTTCACCAAACTGATTGCTCATAGAAATATAGAT 
AATATGTTCATGATATATACATTATTCTGAAATTTTC 

TC CCCCTTACATACATATCAGCAAGC TGCATGATATTTCAAC TATTGCTAC TGCACAGTTTATTCAGC CATCTCCC TATTAATGAAGATTTA 
2 64 97 AGGTTTTTTTTTCAGTTTTTAGCCTCTACAAACAGTACACAATA7\AC^ 

265 89 C C C CAAT GTGGAATTTT TAAGTCAAAGTTTATTGT GTTT TTAATGC TT TAAACATT GC CAGGTTACC GT C CCAAAAGGC TATAACAATTCAC 
2 6681 ATTTCTGTTTCTCTGCATCTTCACCAGACGAGTGTAAAATGGTOT 

2 6773 CATATTTTCATATATTTATTTGC CATTTGTGTTTTTTCTTCCTGAC TTGCTTGTTCACATTGTTTAC CTTGTTTTCTTC TGTCTTGTGTAGT 
268 65 GTAATAGTTTAGAC TCTGAAGCCAGGCAACC TGAGTTAGAAGC CAGGC CTCTATTTCATGATGTAGGTC TTTGGGCAAAGTAC CTAACATTC 
ATGCCTTAGTGTTTTCTC TTTTAATGAGCAGGGATAATAATAGTAC C TGC C TC C TAAGGTTGTATAAAATTAAAATGGGCACTTAGGGTAAT 
ATCTAGCAGGTAGATATTGGCTATTATCAATAGTAGCTCTTATCGTTACTATTCTTCCTV^ 
CTACCCCTGAACCACATTTTTCTACCTCTTAGM 

TAGACCTTAGGAGGGATGGGAGGAACTTTAAGAAGCCGAATTTGCTTTTTATTT7\TCTT 
ACAAGCTTTTAGGTTATTTTTGCATCTAAAGCTGTCCCTTCTTTTC 
27417 GACAAAATCAGAATGCTTTGTGTCTATTTTGGCTAGTAGTTAATTGTTTTTCTTTTATTGTGTCTGC^ 
27509 CGAGCTCATTAGCAGTTATTCTTGCTTTATTCATTTCTTATCTCCTAGC^ 
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27601 GAATTGCAT C TACATTAAGAACC CAC TC C T C T GC T C TAGC GTC TGAGAACATAACACAGTATTTGCC TTTGTTGAAGGGC TCAAGC GGAC GA 

27693 TTCAGAAGTAGAATAGATAACTGTTGGTGGTGGCTTGCAGCACCTAGTAC 

27785 AAGCAAGTT C TT T GGGTTCAGTTAAATAGATT C CAC TTGCAC GTGTTC TCAC T C TTTTGGTGTGAAAAATTAGGAAAGGTGATAGGCAGGAT 

27877 AGAATAAAATGAAGTGGTCCCATCTCCCATATGGAGAGCGCCTGCCTCm^ 

27969 GCAGGAACGCCCCTTCACCTCCATAGCCTTCAGGC^ 

28061 GTTTAACATTTTTACCTGGAAGTGAAAATGGAATC CAGAGAGGCTAAGTAGC TTGCACAGCTACTTAATTGAACTAGAACCAGAACCCAGTT 

28153 TCTTAGTATTCTGGTGCCCATTTTATATAAACATAGACAGCTGATCATGGTAGTTTG 

28245 TATGGC TAAGAAGATTAAGATGC T TATTTTC TC GTTATC CATGATTAGAAAT GTAATAAGAGAC TTTGTTCTGTCACCAACAGAAGAGACAA 

28337 GACGCCTTTTCTGCTGTGGTTGTGAGTGCCCCA^ 

28429 TATT CC GAC T C CATAGAC TTAAC TGT GGAGAC TGAGAGT AGTGGAGAGCTGAGACTGAC7\AG CAG 

28521 GCATGAAAAATCATAAAGATGGAGATGACTTCTATCTTGTGAATGTCTGAAAAGC^ 

28613 CCCTCATCTCTGCCACCATCTGTCGGTC 

28705 CTCACTTTCACCCTCTCCCTC^GGACATCT^ 

28797 TTAGCTGTCTATTGGATGTTTCCAGTTGGAAA 

28889 TTT GGCAGATTT GT TTT TC C T GAGTTTTC TGT CGTGTTCATGGAT T CACCATTCAGCAGGC TGC C CAAGC TAGAAATGTGGGATTTGTTAAT 

28981 CGGCCCTGCTGTGAGACTGGGAGGCTGGTGTGATAACCCAAGAAAGAC^ 

29073 CCCTGCCCGAGTGATGCTTCTGAAATGTGGAACTCATTTTATTCTCTTACCT^ 

2 9 1,6 5 AATCCTTTTTTTTTTTTTTTAAGTGAAAGCAAGTTTATTAAGA^ 

2 9 zSp GCTGCTGGTTGCCGATTTTTATGTTTTTTTCTT^ 

2 9 ^9 CATTTCTTTTCACTTTTTTTTTTTAATCA^ 

2 9 1 ACCTGCTGTGCCTTCTCCTAGAATGCGCTTTTCTCCTTTCCTGGCCAAGTGT^ 

2 9 CJp CTGACGACCACCCCCTAGTCCAAGTO\GCTCC^ 

2 5 CTGGTGTTCCTTGGGACTAGGGACTTCCTTCATTCACATTTCACATAACTTGAGQ 

2 9 1| 7 AATTAATTAGCATCTTGTCCTTCAAGATCAGCC^^ 

2 9ag 9 AGTCACAGACTGGACTCTATTAAATCCTGTCTATCATCTGG^ 

29901 GAT GTT CAGC TC GATTCT GC C C C TTCAT TC CAAGC C T GT C C CATATT C CATTAC TTTATGAAGC C TTTC TTGACAGACAGATGC TTAATTAT 

2 9 Q£ 3 TCTCTTTTGCTTTCTTTGTGTTGACTTTGACTCTGCCACTGGTTGTGAGCTTG 
30Q|5 AGTGCTTTCTTTGTGTGCTGCACACTCCCTGGCACACACAGCGGCTCTCCAACA 

3 0 p? 7 GGGCAGCTTTTAAATGTATTTGGGCACCTTTGC^G 

3 0 M 9 GCAGAAATCAAAC T GAAAAGTGAAAGCAGAGT CAAAC TTTT TTATTTGGAC C CAGAT GC CCAGGTAAGAACTATC TAAAT GTTTAATATTTA 

3 0|1 1 AAACCAAATGTGGGAGAGAAAATCATCGATGGGCTTATTTGTTTATTTGTTTGC 

3 0© 3 AAATTTGGGGAAAAGAGGAAAAATAAAAATGTATAATCTTATC^ 

3054 5 ATTGGGAGAGATTTATTGAAAGACATTC TCAAGTTCAGAAAGAACAT C TAATTTAC C TGTTAAAATAAC CATCAGAAAACAACAGGTATGAC 

30637 TGCAGTTGCCTGGGAGTGAGTGATAATTCCCGACTAGCCCAGGCTC^ 

30729 TTTCTGTTTCCTCACAAACCTCATAACCATACGT^ 

30821 GTAAAATATACATAATATAAAATTTACCATTTTAACCATTTTTAAGTGTACGGTTCAG 

30913 TTACCACCATCCATTTCCAGAACTTCTTCATTTTCCCACA 

31005 AGTAACCTCTGTTCTACTCTGTGAACCTGCCTATTTTAGGAACCTCATAAATGTGGAATCATACAGTA^ 

31097 AAACTTAAC^TGTTTTCAAGGTCAATCCATGTTGTAGCATGTGTC^ 

31189 CTACATTTTATATATCCTTGTAAATCTGTTGATGGACACTTGGT^ 

31281 AGCTCTGTATTTTTTCAGTTCATCCATTGMTAGGTATACCATCATGTCTTTTTTTTTTTGTCTTTTT^ 

31373 TCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCAATCTCGGCTCACTGCAAGCTCCGCCTCCTGOT 

31465 CCTCAGCCTCCCGAGTAGCTGGGACCACAGGTGCCCACTACCACACCTGGCTAATTTTTT^ 

31557 TTAGCGAGGATGGTCTCGATCTCCTGACCTGGTGAGCCGCC^ 

3164 9 GCCCATGTCTTTGACCATTGTTATAAACTATGTGTGTAACTACT^^ 

31741 TAAGTGTATATAGCTTTTCCATATTTTATTCCCTTTC^ 

31833 TTACATTTTCTTTTAAAGAAGCTTGACTTCTCATCAGCTGAGC CAGAAGTGAAGTCATTTGAAGAGAAGTTTGGAAAAAGGATC CTTGTCAA 

31925 GTGCAATGATTTAT C TTTCAATTTGCAAT GC TGTGTTGC C GAAAATGAAGAAGGAC C CACTACAAAT GTAATTTTTCATTTTAAAAATAAAC 

32017 ATTAAAAAAAAAATAGGCAGAGGTTTCAGATGTACCTTTACAGTGC^ 

32109 TTTGGATCAGGATATGGAGGTTGTTAGCCTGCAAGGA(^GGATGTTCGTGAT 
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32201 GACAGAGTTCATC TTAGATGC TTCAGGGAACGCAAC TGTGTTTTC TATGGAACATACATTAC GTAGCAAAACACATAAAGGTAAAATAATTG 

32293 TTTTGTTTGTTTCCTAAAATGTTTTATAAGCTAATTCTCTGT^^ 

32385 TAGGTCTTGTCATTTCTCTTTTGATAGCAACATC^ 

32477 AC C TTTC TTT GTTAC TC TATC CC T GT TTGACATAAAATACAAC C GGAAGATT TC TGC C GAT TT C CAC GTAGAC CTGAAC CATTT C TCAGT GA 

325 69 GGCAAATGCTCGCGACCACGTCCCCGGCGCTGATGAATGGCAGTGGGCAGAGCC^ 

32661 CAGTATCCGAAGCAGGTGGGGAGTATGAGCCCAGCAT^ 

32753 TTAGACCTTGTAATGCACAAGTGGGGTCATTAG^^ 

32845 TTTAATTTC C TTTCTTCTTTCTAAAAGTTCTC CTAGTTAC CTC CCTCACCAAATAGACTTTTTGAGCAGATGATGAACTGTTTGTCAGCTA^ 

32937 CTAGTTTGGCACTGGGTGCTTTTTACTAGTTGTCCTGTTTCACTGTTCTTTGCTCT 

33029 TTCTGCTTACTGAAGAAAATTGTTTGCCTCCTAGGGAATATTTTC 

33121 AAGTCCTTCAGGGGAGCATCACACATTGCGCTGAGCCATATATGAAAAGTTCAGACTC 

33213 GGTAAAAAGCAATCTGAAAAGAGGCCTTTATGTGAT^ 

33305 GAATATGCTACCTGTATTTACTCTGAACTTTATGTCTTGATTTTGAAGGAAATGTAGTCCTATGTCT 

33397 CAGTTCTGAGATCCTAAAATAATTTGCAAAGGAATTACCAGCTTAAT^ 

334 89 AGTTTCTTCTATCTGTTAATTTGCCTCTTGGGTTCTG^ 

33581 CCTTGGCTCAGATGACCTGGCCAGCAGTGTCAGTGATTCAGGTCCACTTGGTT^ 

33 673 GAAAACATACATGATGTATGTTTGGTTTTTTTCAAAGTT^GTGTTCATTACT 

3 3 J§ 5 TATTAGAGGTGATCTAGTCCAAGATGTGCTTTCATATTT 

3 3 $5 7 GAGC TGC CAGGGGGCAC TTGACT GC CAC T TC GTAGCAC C TT GTGC TAC CT GGTTAGT GT AAT C T GTTAAGTGC TATTAT C C TTGC CAGTTTT 

3 3 p 9 ACATATTTTTAGTTATTTAAAAAAAAAATTGTGTGGTCATCTGAGAGGCCTGAA 

3 4 &£ 1 CCCAGAGGCTGTGAGTGAAGTTTACTAAGTTGGATTCAGAGCTTCCTATCTTCACCTCTATGG^ 

3 4 R 3 GATGCACGATGGCCATTGAGAAATGGATTTTGGAGTCAGAAGA^ 

3 4g| 5 CACCTCTCATGGCCTC(^TATGTTCCTTCTGTGCATGAAGGATGATGTTACTTCTTGCCTCTGC 

3 4|| 7 AACAGATCATCTATG^ 

3 4^0 9 GC C CATT TAC C CAGGCACATT GGTT C CAACAGTAAGC CTTT TTGGC T GATGAAAGC T GTGTAAAGTTTGGTCT C TGGAGAGAAGC T GTTTTA 

3 4£ 0 1 TT TTTTT AAACCAAGTC TGTAAAAC C TT GGAT GAGAAGC T C T T TTAGC TC TT TTAT GTTTTGATCAAT AATCAAT GAAGGC C CAATATAAGA 

3 4££ 3 TCTCCTCCCCCGACCGTGTATGCAACACATTTCCAAGGCCCATCCACAGCAACTTTGTTA^ 

GGCAGC T CATATT GGTGT AAAAAT CACATATCAC T GTAGGC TAAAC TTAC C T C T GCACACTC C TC CAT GTCCAC TGAGCATCT GC T GAAGTC 
TGCTTTTTCTTCMTTTTTTATGGA^ 

3 ^ 9 TAAAGCAGGAATT AAGGC TCAAC TATCTTAC TTTAGCACAGTTTTGGCAGAGAT GT TACAGT GAGAT GATTTT TTTCT GT C TGT CAAAGTTG 

3 ^ 1 TTTCTTCATGTTTTCCAAGATGGTCTAGAACATCATTTAGAGTA 

3 5335 3 CCTGTGAATAGAGGTTTTAAAAAGAAAAAGAAGGGGAAAAAAGCCCAA^^ 

TTCTATTTGCTCCCTTCATATGTCCGAGAGCTAAGTCCTCATTCACTGCAGAAAAGGCTTAT 
TGAAATTACTGCATTTTACTCCACAACATATTCATCATTGTTAGAACCAAA 
GCTAGGTAGGCCATTGTACCCTATAACTCATACACCTAAGACCCCACTT^ 
35 421 GGGGGGAGGTCCTAGATGATTCCCTGACATCTCTTCCAATTAGGATGTATTTTCCAGGTTACTC^ 

35513 TAT GGTAGGATTATAAAGT TT TGCAACT GAAAGGGAT CAT GAAAAGTATAGC C TAGC C C C CAT TTTCAGGTGT GGATAGT GGAGC TCAGAGA 
GGTT TGATGGCC TACAGAGTCAT ACAGC T GGATTCAGTGTGAC TTAGAGACAGACAGC T C TGT CATC TAGATC C TTCTGGATC C T TC TAGAT 
CCTTCCTGTGGCTGATGGTACACACAGATCACCGGAGGTCTTGTC^ 
35789 TGCATTTCTACAGTGATGCTGATGCTATAGCACACTTTGGTTT^ 

35 881 GACCCTTCAGTAAAAAAAAAAAAAAAAAAATTAAGTAATGTAAAAGACCCTTCACTGTA 

AAGCAC TAT CACAGTACAGACAAATTTGGAAAAAT TAAGCAT TAT C T TTTTTAGCAAT AGGGGT ATAGAT GATTAATAATAGGTC TGCC TTA 
GGC TAAAAGCAAATAAGC TT CAT T GT ACAC T GTGAAT TAAT C AAT CAATCAACAACAAAC CAAAATTGAT CAT TAGGAAGTAGAAATAGCAG 
TAATTCATGTTTAGAAAGCAAGAATATAGCATTGAGAACCCC^ 

TCAGCAT AAGCTAGTGC GC TTTAGGAGC TGTGAAAGCTTAGTATTTTAATTAGTGTTCT(^ 
ATGGATACCAAATAGTAATTATTAATAACTCAGTAGACTTATAATAAGTAGCACTTAGTCAT 
GTTTGTTGCCCGTAAAGACCAAAGAAGAGACATGGATTTTGGAAAGTCTTCGTTCTGTATTCT 

ATAAAGTAC TCT GCACAGAGT GT GAATC CAGC CAT AT TCAT T ACT GTATGAT GT GC TT TTC TAT GCATC CAATGC TATGGTAAGGAC TTATT 
TAGTGAGCATGTAAGGATGGCTCTTGAGGTCACAGTTCTTTCAGTGAGATGCAGTATCTCATAGCT^ 
AGAAGGTGCTGAAGAATGCCAAGCAGGCATGCCAAAGACTAGGA 



34gf85 
34331 



35145 
35237 
35329 



35605 
35697 



35973 
36065 
36157 
36249 
36341 
36433 
36525 
36617 
36709 



FIG. 12B 
8 of 10 



36801 ATACCTTTTAAATCGATATAGATAGGTGCATGGATGGGTCAAT^^ 

36893 AACTGAGAAATACATAAGCCCAGATTTTGAAAAAATCATTTGGTAGAGTC^ 

36985 TGGCAGGGTGCACGGTAGTGTTAGCT 

37077 CTGGGGTGATCTTTATCTCTTGACTCTACTTGTTAAGCATTTGACTTGTCT 

37169 CTAT GTAAATTGATT TAGTTGTC C TCAT C C C CATAGATGTT TTCCATGTTTTTAGATAATGAGATTT CTGTTGGCTATAGC CAATGGAATAA 

372 61 TAATTAGACTTCTCATAGAAACTAGACTTAAATAATGAATTGATC 

37353 GACATTGTTTAAGGATGCATCTGGAAATCTTGA^ 

374 45 ACATGCTCAAGTTACTTGCAGACTTTCGGAAGTGAGTTTCAAGGT^ 

37537 AACAGGGGAGCAGTCACTTAGGTTGCTCGATTTAGACATC^^ 

3762 9 GATGGGGTGAAAATCAGAAGAATAAC CAGTTAATTT GAATTC T GT AGAGGAT GTTT TGGGTGGTGCTGT GAAGGGTGGACTGGGTAAGGATG 

37721 AGCCTATGGTGGGGAGGAAACAGTTGPdSGAAC 

37813 GACAAATTGA^GTCGTGCCAAAGAGATACGATGACTGGGT^ 

37905 TTGGAAGTTTCTGGCACCAGTTGTTGCTTAGCTTGTGG 

37997 TGTCTAAATTCCCTTCTTAGAACACTAGTGATAAAGC^ 

38089 TTTCCAAATAGGTCACATTTATAGGTACAAAGGGTTGGGACTT 

38181 GGGAGCAAGGAATGT GT GCAACAT CAGAGGGC CGGCAGC TT C C CCAAGTCAGTCTCACC CGAGGGTCTGTGTTC TTAACCT CTAT GCTGTTT 

38273 TGCTGCTACATCC TAAAGAGTTCAC TCTGAACCTTTGAAACTGATTTTCTTTGCTAG 

38365 GGGAATGTGAAAGAGCTGAGGGC GC TAGAAGATGTGAAGT GAAAAGAATAGC TGAGAGC CAAAT GCTAAC TATTC TATGC CAAAGGTATCC T 

3 8=45 7 TGTTTTTTTTTTTTTrTGTGCATATCAAAATAGCAATCTTATCAGTTTGTCm 

3 8M 9 TTACCTTTTTCTTATCTGTCTTX^GTAGTAGGAATAGAAA^ 

3 8^4 1 TTC CTAATCAGAAT C CAATCACGTC CAT GT CAGCAGGCGGC TTCAGC C TT CACAGC GAC GT GAAATC C CTTGTCAAGAGGC TCAAAAAGGTA 

3 8pp 3 GAAAGGATTCTCAAGGTCTCTTTCAGTTATGTGATTATACAGTT^ 

3 8j£2 5 AACCTCAGTTTTGCTAACATGCAGCTAAGGTTGGCCTGTTC^ 

3 7 GCTCACGTAGGGAATTGGAGAAGGGGGAGAGGAGGATAC^ 

3 9JT0 9 GGTGCC CCAAATTC CCAGTCACTATCTGACAGTTTTATGAC C TGGTAAGGACACAGGTC TTGGCCAGGGAGTGC CC CTGGATC C C TATGAAT 

3 9 1 1 0 1 CTGTTATTCATGAAAGACTAAATAAAAGAATAGTACCCT^^ 

3 9J.9 3 CATTTTAGAAATTT T GAGAAATACAGAAGAATAAAAGGATGAAAAAAGGT TTAC TAC T AGTTTTAAC C TTCGTGGTGAAC TTTTGGAGAAC T 

3 SgjJB 5 TTTTTTTTTTTTTTGAGATGGAGTCTCACTCTGTCCCTCAGGCTGGCGTACAGTAGCA 

3 937 7 TTCAAGCGATTCTCCTGCCTCAGCCTCCCAAGTAGCTGO 

3 ^4j6 9 TGGGGTTTCACCATATTGGCCAGGCTGA 

3 <g§6 1 GAGCCACCGTGC C CAGC C TAGGGGGGAACATTTTTTTTTAC GTTTTAT TC C TTTACATTTTATTTTAGT TTATC TTATGTAGC TATGAT CAT 

3 SgsrS 3 ACTAAATATGTAATATTTCCCTGCACAACTCAAGTATT^ 

39745 TTAT7\ATAGTCCATTGAGATAGACCATAGATTATTTAACTCTTCCCCCATTTTTTGACTTTTTTT 

39837 GTCGCCAGGCTGGAGTGCAGTGGC^CCATCTCAGCTCACTGCAACCTCCGCC 

39929 AGCTGGGACTCCTGAGTAGCTGAGTAGCGCATGCTGCCACGCCCCGCTAATTTTTTTTTTTTTTTOT 

4 0021 TACTAAATATCTCACCATCTTGCCCAGGCTGGTCT(^ 

40113 AGGTGTGAGCCACCATGCCCAGCCATTTTTTGACTTTTAATGTGTTTCTGATTTTTCAGAA 

4 0205 AAAAAATCTTCTCTATTGGTAGTGGGTAATATATTATC^TACATACTATATTATCATATAGTAA 

4 0297 ATTTCATTCTTACTAATTTTTTCAAAAACCAGTCACCTTT^^ 

4038 9 TAACCATGAGGTGGGTCTGCGTGTACTTAGGAAAGTGAATAC^^ 

40481 AAGAAAATATATATTTCTTTTGTATGTAAATGAAGAA 

40573 GGATATATGC CAAAT AATCATGGTTATC TC T GAGGGATGGGTT GAT GGGTGATATTT CACC TT C CAC TTTATAACATTCT GTCATTTTTATA 

40 665 TGAACTTTTAAAAAACTAACACTTTTATATTCAGACAA 

40757 TTACTTC CTAGGCTGGTTAGCTTGAGACTTC CCCACAGTGGC GGC TCCTCAGGGGCAGCCC CAGTGCATGGTCCTGTCTTCAGTGGAGGCTG 

408 49 GGGAGTGGGGCTTCACATGGTCACTAATTTGAAAGTGOT 

40941 GAGTTCTCTTCTGCACACCACTTCTTCATGCATGTGCTCAGCAGGAGGGCATT 

41033 AGAAAC C TGAAAACAGGGATC CGAT GGT GACAGCATAGAAGACACAGCAGGATAAGTGAGGCCACGC TC C TCAATAAGTATTCAAAGAAAC T 

41125 TTGGTGCCt^CTCCCCGTATTCTTCACAACAGAGTTAG^ 

41217 CCTTCCTCTGTATATTTTACAGGAAATAATCTCATGTCAGTGGC 

41309 TCTCAAAATGTTATTTAATAATGCATGAAAAAAATTTCTTCACGCTGTCTC^ 
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41401 CTTTTTCC CTTGCAATAATTACC CA.GGGATATCTTC CAAGM 

41493 AGAAGAG^GACTTGGTACTTTCTATTTGA^^ 

41585 TGCGAAGACATGAAGGCAAAGTATAAAAATAGAACGTTTTCTTTAAAACGTAGACCTTT^ 

41677 ATACTACTAAAGTTTTTGCGTATGCAGCTT^ 

41769 GATTTCGCTCTGGTTACACTGO\CT 

418 61 TCTGCCCAAATTGTGTCCCCTTCCTGCCTCATCAAGCAGCACATAAATCACAAA^ 

41953 TCTCAGGGAACTCTACTCATCTTTCAAAGCCC^TCTGTGGCTCACTTCTGTGCTGGG 

42045 AGCCGCCTCCTCTTTTTAAGGGTGGATAATAAC^ 

42137 TCCCACCATGAATACTGATCTCATCCC^ 

42229 GTGATTGCCCTCATATGCCGTAAGTAGCTT^^ 

42321 ATGGTCCTGAGTACATACATTGCAGCAT^ 

4 2413 TGAAACAAAAATATCTCAAATTTCTTTCAATCATATATAGTTGT^ 

42505 AACA(^GTTAGAAGATTAAACTCACCA^ 

425 97 AGGAAGGGAAGACCGGCTGCAGAAACACTTAGATAAGGACCCCAAATCTGTT 

42 689 CCTCTGTGGGAATTGTTGAGTCAGGASAAA 

42781 ACACAGAGACAGGAGCAGTTCCCAGAGGCCAGGCATCC 

4 2873 TCATGTTCAAGGATTATTTTATAAATTT^ 

42965 AAAGAATTCAGTTTGTGTAACACGCACAC^CCACCACTTTGGAAGTGCAT 

4 3 S3 7 TCTAATAAACCAGATTCAACATAAACCATAAACTTTTGAAATGGGTTTGAGAT 

4 3fi 9 ACTTTTAAAAGCCGATGTGCTACATAATAT 

4 3 ?4 1 CTAGAAGTAAAATCTCTCAGCAGAGGGAGGAAATAGAAT^ 

4 3?| 3 CCAGTGAAGATCATTTAATTAAAATATGTTGCTTA 

4 3 M 5 GAGTGATCTTAAAAATGGTGATGAAGATGCCTGTTC^TTCATAGGTGGAAATAATTAGG 

4 351 7 TTATATTTAAAAGTATAATTTGTAATAAA 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



MFPME DISISVI GRQRRTVQ 20 

MTHLNSLDVQLAQELG 16 

MAERRAFAQKISRTVAAEVRKQISGQYSGSPQLLKNLNIVG 41 

MLL FP YDDFQTAI LRRQGRY I CS 23 

MAAS ERRAF AHK I NRTVAAE VRKQVS RERSGS PHS S RRC S S S L 4 3 

MSFRGKVFKREPSEFWKKRRTVRRVIQEEFHRFSSQEKPRLLEPLDYETVIEELEKTYRN 60 



hCLASP4 S T VPE DAEKRAQS L FVKEC I KT YS T DWHWN YK 53 

hCLASPS DFT 19 

hCLASP3 N ISHHTTVPLTEAVDPVDLEDYLITHPLAVCSGPLRDLIEFP 8 3 

hCLASP2 TVPAKAEEEAQSLFVTEC I KT YNS DWHLVNYK 55 

hCLASP7 G VPLTEWEPLDFEDVLLSRPPDAEPGPLRDLVEFP 79 



hCLASPl DPLQDLLFFPSDDFSAATVSWDIRTLYSTVPEDAEHKAENLLVKEACKFYSSQWHVVNYK 120 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
ACLASP7 
"riCLASPl 



YEDFSGDFRMLPCKSLRPEKI PNHVFE I DEDCEKDED SSSLCSQKGGVIKQG 105 

DDDLDWFTPKECRTLQP-SLPEEGVELDPHVR DC VQT Y I RE WL I 63 

PDDIEWYSPRDCRTLVS-AVPEE-SEMDPHVR DCIRSYTEDWAI 126 

YEDYSGEFRQLPNKWKLDKLPVHVYEVDEEVDKDED AAS LGSQKGG I TKHG 107 

ADDLELLLQPRECRTTEP-GI PKD-EKLDAQVR AAVEMYIEDWVI 122 

YEQYSGDIRQLPRAEYKPEKLPSHSFEIDHEDADKDEDTTSHSSSKGGGGAGGTGVFKSG 180 



MfrCLASP4 
"hiCLASPS 
0gCLASP3 
UjCLASP2 
^|CLASP7 
-hCLASPl 



WLHKANVNSTIT — VTMKVFKRRYFYLTQLPDGS Y I LNSYKDEKNSKESK-GC I YLDAC I 162 
VNRKNQGSPEIC — GFKKTGSRKDFHKT-LPKQTFESETLECSEPAAQA — GPRHLNVLC 118 
VIRKYHKLGTGF — NPNTLDKQKERQKG-LPKQVFESDEAPDGNSYQDDQDDLKRRSMSI 183 
WLYKGNMNSAI S — VTMRSFKRRFFHLIQLGDGSYNLNFYKDEKISKEPK-GSI FLDSCM 164 
VHRRYQYLSAAY — SPVTTDTQRERQKG-LPRQVFEQDASGDERSGPEDSNDSRRGSGSP 179 
WLYKGNFNSTVNNTVTVRSFKKRYFQLTQLPDNSYIMNFYKDEKISKEPK-GCIFLDSCT 239 



gCLASP4 
fHCLASPS 
HgCLASP3 
yCLASP2 
HCLASP7 
hCLASPl 



DWQCPKMRRHAF^LKMLDKYSHYLAAETEQE^EWLITLKKIIQINTDSLVQEKKETV^ 222 

DVSGKGPVTACDFDLRSLQPDKRLENLLQQVSAEDFEKQNEEARRTN RQAE 1 69 

DDTPRGSWACSIFDLKNSLPDALLPNLLDRTPNEEIDRQNDDQRKSN RHKE 234 

GWQNNKVRRFAFELKMQDKSSYLLAADSEVEMEEWITILNKI LQLN FEAAMQEK 219 

EDTPRSSGASS I FDLRNLAADSLLPSLLERAAPEDVDRRNETLRRQH RPPA 230 

GWQNNRLRKYAFELKMNDLTYEVIJ^TESDMDEWIHTLNRILQISPEGPLQGRRSTEL 2 99 

it » -k * • • * 

• • • • • • • 



hCLASP4 
hCLASPS 
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hCLASPl 



TAQDDETSS QGKAENIMASLERSMHPELMKYGRETEQLNKLSRGDGRQNLFSFDSE 278 

LFALYPSVD EE DAVE I RPVPEC PKEHLG N RILVKLLTLKFEIE 212 

LFALHPSPD EEE P I ERLSVPD I PKEH FG QRLLVKCLSLKFEIE 277 

RNGDSHEDD EQSKLEGSGSGLDSYLPELAKSAREAEIK LKSESRVKLFYLDPD 272 

LLTLYPAPD EDEAVERCSRPEPPREHFG QR I LVKC LS LKFE I E 273 

TDLGLDSLDNSVTCECTPEETDSSENNLHADFAKYLTETEDTVKTTRNMERLNLFSLDPD 359 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



VQRLDFS GIEPDIKP-FEEKCNKRFLVNCHDLTFNILGQIGDNAKGPPTNVEPFFI 333 

IEPLFAS IALYDVKERKKISENFHCDLNSDQFKGFLRAHTPSVAASSQARSAVFSV 268 

I E P I FAS LALYDVKE KKK I S EN FY FDLNS E^IKGLLRPHVP PAA ITT LARS A I FS I 333 

AQKLDFS SAEPEVKS-FEEKFGKRILVKCNDLSFNLQCCVAENEEGPTTNVEPFFV 327 

IEPIFGI LALYDVREKKK I SENFY FDLNS DSMKGLLRAHGTHPAI S T LARS AI FS V 329 

I DTLKLQKKDLLE PES V I KP FEEKAAKR IM 1 1 CKALNSNLQGCVTENENDP I TN I E P FFV 419 
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hCLASP4 
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hCLASP7 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
HCLASPl 



SyC LASP4 
WfcLASPS 
KCLASP3 
feLASP2 
yCLASP7 
^LASPl 



g|CLASP4 
ifiCLASPS 
^CLASP3 
SCLASP2 
3SCLASP7 
%CLASP1 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



NI^FDVKNNCKISADFHVDLNPPSVREMLWGSSTQLASDGSP KGSSPESYIHGIAE 390 

TYPSSDI YLWKIEKVLQQGD IGDCAEPYTVIKESDG GKSKE-KIEKLKL 317 

TYPSQDVFLVIKLEKVLQQGD IGECAEPYMI FKEADA TKNKE - KLEKLKS 332 

TLSLFDIKYNRKISADFWDLNHFSVRQMLATTSPALMNGS GQSPSVLKGILHE 381 

TYPSPDI FLVIKLEKVLQQGD ISECCEPYMVLKEVDT AKNKE-KLEKLRL 378 

SVALYDLRDSRKISADFHVDLNHAAVRQMLLGASVALENGNIDTITPRQSEEPHIKGLPE 479 

"ic * 4r • * * • * • 

• *•>•• ♦ * • ••• • 

SQLRY IQQGI FSVTNPHPE I FLVARIEKVLQGNI THCAEPYIKNSDPVKTAQKVHRTAKQ 450 

QAESFCQR LGKYRMPFAWAP I SLS S FFNVS TLEREVTDVDS WGRS PVGERRTLA 372 

QADQFCQR LGKYRMP FAWTAI HLMN IVSSAGS LERDSTEVE I STGERKGSWSERR 4 37 

AAMQYPKQG I FS VTCPHPD I FLVAR I EKVLQGS I THCAE P YMKS S DS S KVAQKVLKNAKQ 441 

AAEQFCTR LGR YRM P FAW TAVHLAN I VS S AGQL DRDS D SEGERRPAWTDRR 429 

EWLKFPKQAVFSVSNPHSEIVLVAKIEKVLMGNIASGAEPYIKNPDSNKYAQKILKSNRQ 539 



VCSRLGQYRMPFAWAARPIFKDTQGSLDLDGF 

QSRRLSERALSLEENGVGSNFKTS TLSJVSS 

NSSIVGRRSLERTTSGDDACNLTSFR- 



patlt vtnffkqegdrlsdedlykfladmrrps 
acqrlgqyrmpfawaartlfkdasgnldknar fsai yrqdsnklsnddmlklladfrkpe 

rrgpq — dras sgddacsfsgfr- patlt vtnffkqeaerlsdedlfkfladmrrps 

fcsklgkyrrafawavrsvfkdnqgnvdrdsp|fsplfrqesskistedlvklvsdyrrad 

* • * • 4r • • • 4r t -ic • ^ * • * • • • 



FS PL YKQDS SKLS S EDI LKLLSE YKKPE 
FFKQEGDRLS DEDLFKFLADYKRS S 



510 
427 
496 
501 
483 
599 



--KTKLQIIPGQLNITVECVPVDLSNCITSSYVPLKPFE-KNCQNITVEVEEFVPEMTKY 567 

SLQRRVKSIPGLLRLEISTAPEI INCCLTPEMLPVKPFP-ENRTRPHKEILEFP — TREV 484 

S VLRRLRPI TAQLKI DI S PAPENPHYCLTPELLQVKLYP-DSRVRPTRE I LEFP — ARDV 553 

K-MAKLPVI LGNLDI T IDNVSSDFPNYVNSS Y I PTKQFETCSKTP I T FEVEE FVPC I PKH 560 

S LLRRLRPVTAQLK I DI S PAPENPH FCLS PELLH I KP YP- DPRGRPTKE I LE FP — ARE V 54 0 

R- 1 SKMQTI PGSLDIAVDNVPLEHPNCVTSS FI PVKPFNMMAQTEPTVEVEEFVYDSTKY 658 



* + 



CYPFTIYKNHLYVYPLQLKYDSQKTFAKARNIAVCVEFRDSDESDASALKCIYGKP AGSV 627 
YVPHTVYRNLLYVYPQRLNFVN — KLASARNITIKIQFMCG-EDASNAMPVIFGKS SGPE 541 
YVPNTTYRNLLYIYPQSLNFAN — RQGSARNITVKVQFMYG-EDPSNAMPVIFGKSSCSE 610 
TCPYTIYTNHLYVYPKYLKYDSQKSFAKARNIAICIEFKDSDEEDSQPLKCIYGRP3GPV 62 0 
YAPHTSYRNLLYVYPHSLNFSS — RQGS VRNLAVRVQ YMT G - E D PS QAL P V I FG KS 3 C S E 597 
CRJP YRVYKNQI Y I YPKHLKYDSQKC FNKARNI TVC I E FKNS DEES AKPLKC I YGKP £GPL 718 

•••• • ••••• •*■! • 



+ + 



FTT^ AYAWSHHNQNPEFYDEIKIELPIHLHQKHHLLFTFYHVSCE INTKGTTKKQDT VE 



FLQJ VYTAVTYHNKS PDFYEEVK I KLPAKLTVNHHLLFTFYH I SCC Q KQGA£ 



FSKJ AYTAVVYHNRSPDFHEEIKVKLPATLTDHHHLLFTFYHVSCC Q KQNTE LE 



ftr: 



IIKIELPTQLHEKHHLLLTFFHVSCI NSSKGSTKKRD\ VE 



afaavlhhhqnpe fyde 

ftri aftpvvyhnkspefyeefklhlpacvtenhhllftfyhvsc(3p rpgtale 

fts;aytavlhhsqnpdfsdevkielptqlhekhhilfsfyhvtce 



* • + . * 



+ ★ 



inakanakkke; LE 



VE 



687 
595 
664 
680 
651 
778 



TPVGFAWVPLLKDGRI ITFEQQLPVS ANL P PG YLNLNDAE S RRQCNVEtl KWVDGAKP LLK 747 
TLLGYSWLPILLNERLQTGSYCLPVA LEKLPPMYSMHSAEKVPLQNPP IKWAEGHKGVFN 655 

TPVGYTWI PMLQNGRLKTGQFCLPVS LEKPPQAYSVLSPEVP LPG tfKWVDNHKGVFN 721 

TQVGYSWLPLLKDGRWTSEQHI PVS ANLPSGHLGYQELGMGRHYGPE IKWVDGGKPLLK 740 

T PVG FTW I PLLQHGRLRTGP FCL PVS VDQ PPPSYSVLT PDVA LPG tfRWVDGHKG VFS 708 

TSVGYAWLPLMKHDQIASQEYNIPIATSLPPNYLSFQDSASGKHGGSCIKWVDGGKPLFK 838 
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3ACLASP4 
M&CLASPS 
C&CLASP3 
^iCLASP2 
0§CLASP7 
y&CLASPl 



l£CLASP4 
SlCLASPS 
hfiCLASP3 
ffiCLASP2 
]giCLASP7 
MfiCLASPI 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



FKSHLESTIYTQDLHVHKFFHHCQLIQS GSKEVPGELIKYLKCLHAM 794 

IEVQAVSSVHTQDNHLEKFFTLCHSLESQVTFPIRVLDQKISEMALEHELKLSIICLNSS 715 

VE WAVS S I HTQDP YLDKFFALVNALDE H - L FPVR I GDMR IMENNLENELKS S I S ALNS S 780 

I STHLVSTVYTQDQHLHNFFQYCQKTES GAQALGNELVKYLKSLHAM 787 

VELTAVSSVHPQDPYLDKFFTLVHVLEE3-AFPFRLKDTVLSEGNVEQEL 7 67 

VSTFWSTVNTQDPHVNAFFQECQKREKp MSQSPTSNFIRSCKNLLNVE 887 



* * 



E I QVMIQFLPVTLMQLFR VLTNMTH EDDVP 824 

RLEPLVLFLHLVLDKLFQLSVQPMVIAGQTANFSQFAFESWAIANSLHNSKDLSKDQHG 77 5 
QLEPWRFLHLLLDKLILLVIRPPVIAGQIVNLGQASFEAMASIINRLHKNLEGNHDQHG 84 0 

EGHVMIAFLPTILNQLFR VLT-RAT QEEVA 816 

SPE PLVAFS HHVLDKLV RLVIRPPIISGQI VNLGRGAFEAMAHWS LVHRS LEAAQDARG 827 
KI HAIMSFLPI ILNQLF K VLVQNE EDEIT 916 



INCTMV-LLHIVSKCHEEGLDS YLRSFIKYS FRPEKP 860 

RNCLLASYVHYVFRLPEVQRDVPKSGAPTALLDPRSYHTYGRTSAAAVSSKLLQARVMSS 835 
RNSLLASYIHYVFRLPNTYPNSSSPG-PGGLGGSVHYATMARSAVRPASLNLNRSRSLSN 899 

VNVTRV- 1 IHWAQCHEEGLES HLRSYVKYA YKAEPY 852 

HCPQLAAYVHYAFRLPGTEPSLPDGAPP VTVQAATLARGSGRPASLYLARSKSISS 883 

TTVTRV-LPDIVAKCHEEQLDH SVQSYIKFV FKTRAC 952 



SAPQAQLIH ETLATTMIAI LKQS 883 

SNPDLAGTHSAADEEVKN IMS SKIADRNCSRMSYYCSGSS DAPS SPA 882 

SNPDISGTPTSPDDEVRS I IGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAESTQAMDRSC 959 

VASEYKTVH EELTKSMTTI LKPS 875 

SNPDLAVAPGSVDDEVSRILASKLLHEELA-LQ 915 

KE RPVH EDLAKNVTGLLKSN 972 



ADFLSINKLLKYS WFFFEI IAKSM 

APRPASKKHFHEELALQ MWSTGMVKSM 

>^<MSSHTETSSFLQTLTGRLPTKKLraEELALQWVVCSGSVRESALQQAWFFFELMVKSM 

ADFLTSNKLLRYS W FFFDVL I KSM 

WWSSSAVREAILQHA WFFFQLMVKSM 

DSPTVKHVLKHS WFFFAIILKSM 

* * * * 

* • • ■ 

Cadherin Cleavage 



AT Y LLEENK I KLF RGQR fPETYHHVLHSLLLAIIPHVTIRYAEIPDE SRNVN\ SLAS 

AQHVHNMDKRDS ERRTR ?SDR FMDD ITT IVNWT S E IAALLVKPQKENEQAEKMN I SLAF 

VHHLYFNDKLEAERKSR rPERFMDDIAALVST IASDIVSRFQKDTEM VERLN1 SLAF 

AQHL I ENS KVKLI RNQR fPAS YHHAAE TWNMLMPH I TQK FGDNPEA S KNANf S LAV 

ALHLLLGQRLDTERKLR rPGRFLDDITALVGSVGLEVITRVHKDVEL AEHLN£SLAF 

AQHL I DTNKI QLE RPQR TPESYQNELDNLVMVLSDHVIWKYKDALEE TRRATF SVAR 



+ . ★ 
■ 



FLKRCLTLMDRGFIF tfLINDYISGFSPKDP- 



* ★ * 



HVLAEYKFEFLQTICNHEHYI PLNL 
TL I SMRLE FLRI LCSHEHYLNLNL 

ICSHEHYVTLNL 
HTLFEYKFEFLRVVCNHEHYI PLNL 
FLSDLLSLVDRGFVFJSLVRAHYKQVATRLQSSPNPS ALLTLRMEFTRILCSHEHYVTLNL 

h TLCQYKFDFLQEVCQHEHFI PLCL 



FLYDLLSLMDRGFVF tfLIRHYCSQLSAKLSNL E 

FLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPgVLVSLRLDFLRI 
FIKRCFTFMDRGFVF <QINNYISCFAPGDP 
FLS DLLS LVDRGFVF S LVRAH 
FLKRC FT FMDRGC VFf<MVNNY ISMFSSGDL 



• * 



* * * 



907 

910 

1019 

899 

942 

995 



964 

970 

1076 

956 

999 

1052 



1019 
1027 
1136 
1011 
1059 
1107 
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Cadherin EC motif 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP? 
hCLASPl 



PMAFAKPKLQR VQDS — NLEYSLSDEYCKHHFLVGI LLRETSI 1060 

FFMNADTAPTSP — CPS ISSQNSSSCSSFQDQKIASMFDLTSEYRQQHFLTGILFTELAA 108 5 

PCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELSVPFRQQHYLAGIVLTELAV 1196 

PMPFGKGRIQR YQDL — QLDYSLTDE FCRNHFLVGI LLRE VGT 1052 

PCCPLSPPASPSPSVSSTTSQSSTFSSQAPDPKVTSMFELSGPFRQQHFLAGI.LLTELAL 1119 

PIRSANIPDPLTP SES TQELHASDMPEYSVTNEFCRKHFLIG1 LLREVGF 1157 



ALQDN YqiRYTAI 

ALDAEGEGI 
I 

ALQEFR 



ALEPEAEGAFLLHKKAI 



ALQEDQ : 



S V I KNLL I KHAFDTR YQHKNQQAK I AQL YLP FVGLLLEN I 2RL 



SK VQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAALYLPLVGI ILDAL ? — 
LDPDAEGLFqLHKKVIHMVHNLLSSHDSDPRYSDPQIKARVAMLYLPLIGIIMETV ? — 

IAISVLKNLLIKHSFDDRYASRSHQARIATLYLPLFGLLIENV2RI 
SAVHSLLCGHDTDPRYAEATVKARVAELYLPLLS IARDTL ? — 



evrl: 



E VRHLALAVLKNLMAKHS FDDRYREPRKQAQI ASLYMPLYGMLLDNM ?RI 



1116 
1143 
1254 
1108 
1177 
1213 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
HCLASPl 



AGRDTLYSCA AMPN-S AS RDE FPCG FT S PANRGS LS TDKDT AYGS 1160 

QL CDFTVADTRRYRTSGSD 1162 

QLY DFTETHNQRGRPICIATDD — 1276 

NVRDVSPFPVNAGMTVKDESLALPA-VNPLVTPQKGSTLDNSLHKDLLGAISGIASPYTT 1167 

RLH DFAEGPGQRSRLASMLDSDTE 12 01 

YLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFSKDVLNSIAAFSSIAIS 1273 



&CLASP4 
MtLASPS 
ffCLASP3 
SCLASP2 
kfcLASP7 

m:laspi 



FQ-NGHGIKREDSRGSLIPEGATGFPDQGNTGEN TRQSSTRSSVSQYNRLDQYE 1213 

EEQEGAGAINQNVALAIAGNNFNLKT SGIVLSSLPYKQYNMLNADT 1208 

YESESGSMI SQTVAMAIAGTSVPQLTR PGSFLLTSTSGRQHTTFSAES 1324 

STPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSE 1227 

GEGDIAGTINPSVAMAIAGGPLAPGSR AS ISQGPPTASRAGCALSAES 1249 

TVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPLALIGSTLRFDRLDQAE 1327 



JfCLASP4 
g|pLASP5 
IhCLASP3 
ftCLASP2 
JgCLASP7 
SsCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



1 



drsllmcylyivkmisedtlltywnkvspqelinilillevclfhfrymgkrniar^hda 
rnlmicflwimknadqslirkwiadlpstqlnrildllficvlcfeykgkqssdkvstq 
rsllicllvm.knajdetvlqkwftdlsvlqlnrlldllylcvscfeykgkkvferyinsl 
ksllmc fly i lksmsddalftywnkastselmdfft i sevclhqfqymgkryiar tfqeg 
rtllacvlvfvlkntepallqrwatdltlpqlgrlldllylclaafeykgkkaferfl 
tJrsllmcflhimktisyetliaywqrapspevsdffsildvclqnfrylgkrnii 



R< 



NSL 
IAA 



WLSKHFGIDR— 
VLQKSRDVKAR- 
T FKKS KDMRAK- 
LGPIVHDRKS-- 
TFKKSLDMKAR- 



-KSQTMPALRNRSGVMQARLQHLS S LES S 

LEEALLRGEGARGEMMRRRAPGNDRFPGLNEN 

LEEAILGSIGARQEMVRRSRGQLERSPSGSAFGSQ 

QTLPVSRNRTGMMHARLQQLGSLDNS 

■ LEEAI LGT I GARQEMVRRSRERS P FGNPEN 



AFKFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHKQHRSQTLPI IRGKN- 



1273 
1268 
1384 
1287 
1309 
1387 



1311 
1311 
1430 
1323 
1350 
1442 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



FFTQCFKTQLL 1359 
IQASS-ALD 1368 
IVyQTVS-VTE 14 89 

1371 

— VRWRKSVTHWKQTSDRVDKTKIfEhffi^ 14 07 

— ALSNPKLLQMLDNTMTSNSNE: DIVHHVDTEANIATEGCLTILDLVSLFrQTHQRQLQ 1500 



FTLNHSSTTTEADIFHQALLEGNTATEVSLTVLDTIS 

— lrwkkeqthwrqanekldktk;£ldqealisgnlateahliildmqeni 

enlrwrkdmthwrqntekldksrj £ I EHEAL I DGNLATEANL 1 1 LDTLE 

LTFNHSYGHSD/DVLHQSLLEANIATEVCLTALDTLSLElTLAFKNQLL 



* *■ * 
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hCLASP4 
hCLASPS 
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^DGKN?I>IKKVFDIH1^FLKNGQSEVSLKHVFASLPAFISKF 1419. 

CKDS LLGGVLRVLVNSLNCDQSTTYLTHCFATLRALIAKFGDLLFEEEVEQCFDLCH 1425 

SKES ILGGVLKVLLHSMACNQSAVYLQHCFATQRALVSKFPELLFEEETEQCADLCL 154 6 

ADHGHNP LMKKVFDVYLC FLQKHQS E TALKNVFTALRS L I YKFPS T FYEGRADMCAALCY 1431 

ARES VLGAVLKWLYSLGSAQSALFLQHGLATQRALVSKFPELLFEEDTELCADLCL 14 64 

QCDCQNSLMKRGFDTYMLFFQVNQSATALKHVFASLRLFVCKFPSAFFQGPADLCGSFCY 1560 



• # 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



EVLKCCTSKISS TRNEASALLYLLMRNNFE YTKRKT FLRTHLQI I IAVSQLIADVALSGG 1479 

QVLHHCSSSMDVTRSQACATLYLLMR — FS FGAT SN FARVKMQVTMSLASLVGRAP DFNE 1483 

RLLRHCSSSIGTIRSHPSASLYLLMR — QNFEIGNNFARVKMQVFMSLSSLVGTSQNFNE 1604 

EILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTHLQVIISVSQLIADWGIGE 1491 

RLLRHCGSRISTIRTHASASLYLLMR — QN FE I G HN FAR VKMQ VTMS LS S L VG T T QN FS E 1522 

EVLKCCNHRS RS TQT EAS AL L YL FMRKNFE FNKQKS I VRSHLQL I KAVSQL I ADAG- 1 GG 1619 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hp LAS P 7 
%*€LASP1 



KCLASP4 
%tLASP5 
3ICLASP3 
|hpLASP2 
4^CLASP7 
tiCLASPl 



WCLASP4 
WCLASP5 
yCLASP3 
□1CLASP2 
CBCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



SR FQE S L FI INNFANS DRPMKATAFPAE VKDLTKRI RTVLMATAQMKEHEKDPEML I DLQ 
EHLRRSLRTIIAYSEEDTAMQMTPFPTQVEELLCNI^SILYDTVKMREFQEDPEML^m 
EFLRRSLKTILTYAEEDLELRETTFPDQVQDLVFNLHMILSDTVKMKEHQEDPEMLIDLM 
TR FQQS LS 1 1 NNCANS DRL I KH TS FS S DVKDLTKR IRTVLMATAQNKEHENDPEMLVDLQ 
EHLRRSLKTILTYA£EDMGLRDSTFAEQVQDLMFNLHMILTDTVKMKEHQEDPEMLIDLM 
SRFQHSIAITNNFANGDKQMKNSNFPAEVKDLTKRIRTVLMATAQMKEHEKDPEMLVDLQ 



* * * + * .* + 



* • 



+ * 



* * * 



* * 



transmembrane 



Y S LAKS YAS TPE LRKT WLDSMAK I HVKNGE FS EAAMC YVHVAAL VAE Fl HRKK 

YRIAKSYQASPDLRLTWLQNMAEKHTKKKC YTEAAMCLVHAAALVAEYLSMLEDH 

YR I AKGYQTS PE-RLTWLQNMAGKHSERSh HAEAAQCLVHS AALVAEYL SMLEDR — 

YS LAKS YAS TPE LRKTWLDSMAR I HVKNGI LS EAAMC YVHVTAL VAE YL TRKG 

YRIARGYQGSPDLRLTWLQNMAGKHAELG* HAEAAQCMVHAAALVAEYL ALLEDQ 

YSLANSYASTPELRRTWLESMAKIHARNGt LSEAAMCYIHIAALIAEYLKRKGYWKVEKI 



* * * 



1539 
1543 
1664 
1551 
1582 
1679 



1592 
1598 
1718 
1604 
1637 
1739 



L FPNGC SAFKK ITPNIDEE GAMKE DAGMMD 1622 

SYLPVGSVS FQNISSNVLEESWSEDTLSPDEDGV 1633 

KYLPVGCVT FQN I SSNVLEES AVSDDWS PDEEG I 17 53 

V FRQGC T A FR V I T PN I DE EASMME DVGMQD 1634 

RHL P VGC VS FQN I S S NVLEE S A ISDDILSPDEEGF 1672 

CTASLLSEDTHPCDSNSLLTTPSGGSMFSMGWPAFLSITPNIKEEGAAKEDSGMHD 1795 



+ ♦ 



* * 



IT AM 



VHYSEEVLLELLEQCVDGLWKAERYEIISEISKLIVPIYEKRREFEKLTQVYRTLHG 

CAGQYFTESGLVGLLEQAAELFSTGGLYETVNEVYKLVIPILEAHREFRKLTLTHSKLQR 
CSGKY FTESGLVG LLEQAAAS FSMAGMYEAVNEVYKVL I P I HEANRDAKKLST I HGKI QE 

VHFNEDVT^MELLEQCADGLWKAERYELIADIYKLIIPIYEKRR 

CSGKHFTE LGLVGLLE QAAG Y FTMGGL YEAVNEVYKNL I P I LEAHRDYKKLAAV HGKI QE 
T P YNENI LVEQL YMCGE FLWKS ERYEL IADVNKP 1 1 AVFEKQRDFKKLS DL YYD I HR 



* * 



1679 
1693 
1813 
1677 
1732 
1852 



IT AM 



DOCK 
i 



motif 



DOCK motif 



J 



L. 



* + + * 



J°lYTK 1 LE VMHTKKRLLGTjFFR^ AEjYGQSjFFEEEtiGKE^ljYKEljKLTGLSE I SLRLVKiFyG 
A FDS ± VNKDH- - KRMFGTjY FR^ G F FG - S KFGDLE EQE E 'V YKE £ AI TKLPE I SHRLEAi YG 
^.FSKIVHQSTGWERMFGTiYFR^GFYG-TKFGDLEEQEEVYKEEAITKLAEISHRLEGIYG 
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^.FTK][MHQSSGWERVFGTjYFR\GFYG-AHFGDLEEQEEVYKEESITKLAEISHRLEEI'YT 
S YLK\ AE WNS EKRL FGHY YR"V AF YGQG FFEEEEGKE^ I YKEEKLTGLSEISQRLLKI YA 
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EKFGTENVKIIQDSDKVNAKELDP? YAHIQ\ T YVKE YFDDKELTERKTEFERNHNISRFV 
QCFGAEFVEVIKDSTPVDKmj^KAYIQI TFVEE YFDEYEMKDRVTYFEKNFNLRRFM 
E R FGE DWEV I KDSNPVDKCKLDP^ KAY I Q 1 T YVE E Y FDT YEMKDR I T Y FDKNYNLRR FM 
DKFGSENVKMIQDSGKVNPKDLDSI' YAYIQ\ T HVII FFDEKELQERKTEFERSHNIRRFM 
EP FGDDWE I IKDS YPVDKSKLDSC KAYIQ] T YVEE YFDTYELKDRVTYFDRNYGLRTFL 
DKFGADNVKI IQDSNKVNPKDLDP? YAYIQ\ T YVTE FFEEKEIEDRKTDFEMHHNINRFV 
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FEAPYTLSGKKQGCIEEQCKRRTILTTSNSFPYVKI , KIPINCEQQI> LKPIDGATDEIKD 
YTTPFTLEGRPRGELHEQYRRNTVLTTMHAFF YIK1 RISVIQKEEFM/TPIEVAIEDMKK 
YCTPFTLDGRAHGELHEQFKRKTILTTSHAFF YIK1 RVNVTHKEEI3 LTPIEVAIEDMQK 
FEMP FT QT GKRQGGVEEQCKRRT I LT AI HC FP YVKf R I P VMYQHHT I LNP I E VAI DEMS K 
FCTPFTPDGRAHGELPEQHKRKTLLSTDHAFP YIK1 RIRVCHREET\ LTPVEVAIEDMQK 
FETPFTLSGKKHGGVAEQCKRRTILTTSHLFPYVK*KIQVISQSST£ LNP I EVAI DEMS R 
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KTAELQKLCSSTDVDMIQLQLKLQC WVS VQVNAG PLAYARAFLNDS QAS KYP PKKVS ELK 
KT LQLAVA I NQE P P DAKMLQMVLQC SVGATVNQGPLEVAQVFLAE I PADPKLYRHHNKLR 



KTQELAFATHQDPADPKMLQMVLQC 
KVAE LRQLC S S AE VDM I KLQLKLQC 
KTRE LAFATEQD P PDAKMLQMVLQC 



SVGTTVNQGPLEVAQVFLSEIPSDPKLFRHHNKLR 
S VSVQVNAG PLAYARAFLDDTNTKRYPDNKVKLLK 
S VG PTVNQG P LE VAQV FLAE I PEDPKLFRHHNKLR 



KVSELNQLCTMEEVDMISLQLKLQG SVSVKVNAGPMAYA&AFLEETNAKKYPDNQVKLLK 
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DMFRKFI QACS ' : ALELNERL I KEDQVE YHEGLKSN FRDMVKELSD 1 1 HEQ I LQEDTMHS P 
LCFKEFIMRCGI lAVEKNKRLITADQREYQQELKKNYNKLKENLRP^IERKI PELYKPI FR 

LCFKDFTKRCEI>ALRKNKSLIGPVQKEYQRELGKLSSP 

EVFRQFVEACG< JALAVNERL I KEDQLE YQEEMKANYREMAKELS E ifclHEQI C PLEEKTS - 
LCFKDFCKKCEDALRKNKALIGPDQKEYHRELERNYCRLREALQPLLTQRLPQLMAPTP- 
EIFRQFADACG(?ALDVNERLIKEDQLEYQEELRSHYKDMLSELSTV MNEQITGRDDLSKR 
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